Imitation Learning, Skill Learning, Movement Primitive Representation, Human-Robot Interaction, Movement Segmentation, Grammar Induction Reinforcement Learning, Robot Learning
TU Darmstadt, FG IAS,
Hochschulstr. 10, 64289 Darmstadt
Office. Room E325, Building S2|02
The Probabilisitic Segmentation (ProbS) method segments initially unlabeled demonstrations are segmented into sequences of movement primitives, while simultaneously learning a library of primitives. ProbS is an Expectation Maximization method, where the E-step computes a new segmentation of the demonstrations given the current model and the M-step learns a new model given the current segmentation. The model is defined as a mixture of primitives. The segmented demonstrations and the learned primitive library are used to induce a Probabilistic Context-Free Grammar for primitive sequencing. The grammar structure is learned using an Markov chain Monte Carlo optimization, where a novel prior based on three Poisson distributions was introduced. The induced grammar serves as an easily comprehensible representation of the capability of a robot with respect to the primitives learned from the demonstrations. The grammar can now be used to generate new sequences of primitives from the library. Finally, the induced grammar is improved through reinforcement learning approaches in order to improve the grammar such that previously unseen tasks can be solved by the grammar and the primitive library.
Before his Ph.D., Rudolf completed his Master Degree in Computer Science at the Technische Universitaet Darmstadt focusing on robotics and machine learning with a minor in bionics. In his masters thesis Rudolf investigated robust and safe policy search methods under the supervision of Gerhard Neumann and Jan Peters. He developed a new policy update approach entitled Information Theoretic Stochastic Optimal Control.
Rudolf is also looks into approaches for anticipatory robot behavior, active learning for human-robot interactions, infusing semantic meaning into learned movement grammars, task and role identification in collaborative human-robot scenarios and robot-assisted learning of human motor skills. Another big interest of Rudolf is the application of machine and robot-learning methods in prosthetic devices. Such intelligent prostheses could improve the adaptation process, the capabilities and the quality of life of the respective person.
Imitation Learning, Reinforcement Learning, Policy Search, Movement Primitive Representation, Skill Acquisition, Movement Segmentation, Structure Learning, Grammar Induction, Skill Composition and Sequencing, Life-Long Learning, Active Learning
Anthropomorphic Robots, Human-Robot Interaction, Semi-Autonomy, Motor Skills, Adaptive Control, Human-In-The-Loop, Robot Learning