Active Learning in Robotics: Exploration Strategies in Complex Environments
Conference: | Humanoids 2014 |
Location: | Madrid, Spain |
Date: | November 18th, 2014 |
Organizers: | Johannes Kulick, Herke van Hoof, Marc Toussaint, Jan Peters |
Summary and Objectives
Robotics has achieved tremendous results: Robots can drive cars, precisely build goods at industrial scale, and perform surgery. The design of agents that can learn such complex skills by themselves is still in its infancy. One bottleneck is the lack of labeled data. Datasets are often limited in size and coverage, since they need experiments on robots or expensive simulations. Thus an important capability of every robot in complex environments is to gather new data.
To do this efficiently, intelligent robots need to be able to identify which task-relevant information is still missing. Only this ability enables agents to actively work towards a better understanding of the environment, the agent's state, and the task at hand, which will eventually lead to better performance.
Active learning is an approach to gain this ability. Within this paradigm the agent chooses the next datapoint to achieve the best learning result with as few data points as possible. Such strategies can shorten the process of data gathering significantly and also lead to the important information more quickly. In robotics, active learning approaches can be applied on various levels, from low-level tasks such as motor control learning to higher level reasoning tasks.
In this workshop, we want to discuss the state of the art of active learning in robotics, but also address important questions that are still open. The topics discussed in this workshop include (but are not limited to):
- What representation of knowledge allows efficient reasoning about it?
- What are successful strategies based on these representations?
- How can we generalize or transfer experiences to decrease the need of data?
- How to explore safely without damaging the robot and its environment?
- How can existing approaches be `scaled up' to large, continuous real world domains?
Intended Audience
The workshop is intended for all scientists conducting research in robotics or machine learning facing the problem of little data in robotics and those interested in exploration strategies for robots in novel situations.
Confirmed Speakers
We are happy to announce the following speakers have confirmed their participation in the workshop:
- Andreas Krause (ETH Zürich)
- Ruben Martinez-Cantin (Centro Universitario de la Defensa)
- Jivko Sinapov (University of Texas at Austin)
- Clement Moulin-Frier (Flowers Laboratory, INRIA Bordeaux)
Contributed Posters
The following contributed posters were accepted:
- Nicholas Kirk: Towards Learning Object Affordance Priors from Technical Texts
- Chang Wang, Koen V. Hindriks and Robert Babuska: Active Affordance Learning in Continuous State and Action Spaces
- Daisuke Tanaka, Takamitsu Matsubara, and Kenji Sugimoto: Optimal Control Approach for Active Exploratory Action Design with Object Manifold Learning
- Christian Daniel, Malte Viering, Jan Metz, Oliver Kroemer, Jan Peters: Learning Reward Functions for Efficient Robot Learning using Bayesian Optimization
Schedule
The tentative workshop schedule is as follows:
09:00 - 09:10 | Opening |
09:10 - 09:45 | Jivko Sinapov: Grounding Object Concepts in Exploratory Behaviors |
09:45 - 10:15 | Contributed paper spotlights |
10:15 - 10:50 | Clement Moulin-Frier: Exploration strategies in Developmental Robotics: a unified framework |
10:50 - 11:15 | Interactive session |
11:15 - 11:40 | Coffee break |
11:40 - 12:15 | Ruben Martinez-Cantin: Bayesian optimization for active policy search |
12:15 - 12:50 | Andreas Krause: Active learning for decision making |
12:50 - 13:00 | Conclusion |
Organizers
- Johannes Kulick, University of Stuttgart
- Herke van Hoof, Technische Universität Darmstadt
- Marc Toussaint, University of Stuttgart
- Jan Peters, Technische Universität Darmstadt
Acknowledgement
The workshop receives funding from the DFG (German Science Foundation) within the priority programm "Autonomous Learning" (SPP 1527). The workshop organizers acknowledge support from the European Community’s Seventh Framework Programmes (FP7-ICT-2013-10) under grant agreement 610878 (3rdHand), 610967 (TACMAN), and 270327 (CompLACS).