Intelligent Control Lab.
Planning, Reaction and Learning Applied to a Real Mobile Robot

Sutton's DYNA algorithm, which integrates Planning, Reaction and Reinforcement Learning, was implemented at ISR in the Robosoft ROBUTER mobile platform by Alex Weiser, a former Tecnische Universitat Munchen undergraduate student who made his Final Project at ISR/IST supported by an ERASMUS grant. The platform uses odometry for self-location inside a well-structured world of obstacles and empty cells. It tries to reach a goal destination with absolutely no previous knowledge of the world, by trial and error. A reward is received if and only if the goal is reached. After reaching the goal for the first time, the robot learns a path from start to goal while it keeps building a limited world model, based on real experiences and also experiences with the world model.
An extension to symmetric (because the robot learns on both the start-goal and goal-start paths) and cooperative (because 2 robots share the world map and communicate policy information about it) reinforcement learning has been made by Sjoerd Van der Zwaan and José Moreira, as their project for the MSc Mobile Robotics course. They use external global vision to localize the LEGO robots (picture on the left) and external processing to tell them where to move next.
Check here a 4mn mpeg with the robots evolving in the maze.

Publications

"Cooperative Learning and Planning for Multiple Robots", Sjoerd Van der Zwaan, José Moreira, Pedro Lima, submitted to ISIC'2000, (7 pages) --gzipped postscript file (163 Kbytes)
"An Integrated Architecture for Learning, Planning and Reacting Applied to a Real Mobile Robot", Alex Weiser, Pedro Lima, ISR Internal Report RT-401-95 (173 pages) --gzipped postscript file (214 Kbytes)
"An Integrated Learning, Planning and Reacting Algorithm Applied to a Real Mobile Robot", Alex Weiser, Pedro Lima, in Proc. of CONTROLO 96, (6 pages) --gzipped postscript file (56 Kbytes)

ISR Main Page	ISLab Main Page	Contact
Last modified: 16:01 15-September-2002