Davide Tateo

Research Interests

Reinforcement Learning: Safe RL, Deep RL Robotics: fast and reactive motion planning, locomotion

Affiliation

TU Darmstadt, Intelligent Autonomous Systems, Computer Science Department

Contact

davide.tateo@tu-darmstadt.de
Room E303, Building S2|02, TU Darmstadt, FB-Informatik, FG-IAS, Hochschulstr. 10, 64289 Darmstadt
+49-6151-16-20811

Davide Tateo is the Safe and Reliable Robot Learning Research Group Leader in the Intelligent Autonomous Systems group. From October 1st, 2025, he is a Senior Lecturer in the Robotics and Semantic Systems group at Lund University in Sweden. You can find more information about him here. Davide joined the lab in April 2019 after receiving his Ph.D. in Information Technology from Politecnico di Milano (Milan, Italy) in February 2019. From April 1st to September 30th, 2025, he was additionally a substitute professor at the PEARL lab of Prof. Georgia Chalvatzaki.

Software

MushroomRL: A Python Reinforcement Learning Library, developed by Carlo D'Eramo and me, that provides both a clear interface to various benchmarking environments and simulators and implementation of many classical and deep reinforcement learning algorithms.

The main goal of his research group is to develop learning algorithms that can be deployed on real systems. To achieve this objective, the group focuses on fundamental properties of the learning algorithm, such as acting under (safety) constraints.

During his Ph.D. research, Davide worked under the supervision of Prof. Andrea Bonarini and prof. Marcello Restelli focusing particularly on Hierarchical and Inverse Reinforcement Learning. He also co-developed MushroomRL, a Reinforcement Learning Python library.

In the first years of his stay at IAS, he worked on the SKILLS4ROBOTS project, whose objective was to develop humanoid robots that can acquire and improve a rich set of motor skills. Currently, he is involved in a wide variety of projects: the collaborative KIARA project to bring advanced manipulation skills to risky scenarios, the DeepWalking project, to learn human gaits from demonstrations, and the INTENTION project, to develop legged robot locomotion exploiting active perception techniques.

Publications

Bib
Toelle, M.; Gruner, T.; Palenicek, D.; Schneider, T. Guenster, J.; Watson, J.; Tateo, D.; Liu, P.; Peters, J. (2025). Towards Safe Robot Foundation Models using Inductive Biases, SafeVLM Workshop @ IEEE International Conference on Robotics and Automation (ICRA), Spotlight.
Bib
Huang, J.; Tateo, D.; Liu, P.; Peters, J. (2025). Adaptive Control based Friction Estimation for Tracking Control of Robot Manipulators, IEEE Robotics and Automation Letters, and IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 10, pp.2454-2461.
Bib
Bohlinger, N.; Czechmanowski, G.; Krupka, M.; Kicki, P.; Walas, K.; Peters, J.; Tateo, D. (2025). Morphology-Aware Legged Locomotion with Reinforcement Learning, German Robotics Conference (GRC).
Bib
Toelle, M.; Gruner, T.; Palenicek, D.; Guenster, J.; Liu, P.; Watson, J.; Tateo, D.; Peters, J. (2025). Towards Safe Robot Foundation Models, German Robotics Conference (GRC).
Bib
Bohlinger, N.; Czechmanowski, G.; Krupka, M.; Kicki, P.; Walas, K.; Peters, J.; Tateo, D. (2025). Learning Robot Locomotion for Multiple Embodiments, The 12th International Symposium on Adaptive Motion of Animals and Machines and 2nd LokoAssist Symposium (AMAM).
Bib
Jankowski, J.; Maric, A.; Liu, P.; Tateo, D.; Peters, J.; Calinon, S. (2025). Distilling Contact Planning for Fast Trajectory Optimization in Robot Air Hockey, Robotics: Science and Systems (RSS).
Bib
Aditya, D.; Huang, J.; Bohlinger, N.; Kicki, P.; Walas, Peters, J.; Luperto, M.; Tateo, D. (2025). Robust Localization, Mapping, and Navigation for Quadruped Robots, European Conference on Mobile Robots (ECMR).
Bib
Bohlinger, N.; Kicki, P.; Tateo, D.; Walas, K.; Peters, J. (2025). Evaluation of an Actuated Spine in Agile Quadruped Locomotion, IROS 2025 Workshop on Climbing Robotics.
Bib
Kicki, P.; Liu, P.; Tateo, D.; Bou Ammar, H.; Walas, K.; Skrzypczynski, P.; Peters, J. (2024). Fast Kinodynamic Planning on the Constraint Manifold with Deep Neural Networks, IEEE Transactions on Robotics (T-Ro), and Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 40, pp.277-297.
Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, International Conference on Learning Representations (ICLR).
Bib
Herrmann, F.; Zach, S.B.; Banfi, J.; Peters, J.; Chalvatzaki, G.; Tateo, D. (2024). Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields, IEEE Robotics and Automation Letters (RA-L).
Bib
Bohlinger, N.; Czechmanowski, G.; Krupka, M.; Kicki, P.; Walas, K.; Peters, J.; Tateo, D. (2024). One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion, Conference on Robot Learning (CoRL).
Bib
Geiss, H.J.; Al-Hafez, F.; Seyfarth, A.; Peters, J.; Tateo, D. (2024). Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion, IEEE-RAS International Conference on Humanoid Robots (Humanoids).
Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, European Workshop on Reinforcement Learning (EWRL).
Bib
Guenster, J.; Liu, P.; Peters, J.; Tateo, D. (2024). Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning, Proceedings of the Conference on Robot Learning (CoRL).
Bib
Bohlinger, N.; Czechmanowski, G.; Krupka, M.; Kicki, P.; Walas, K.; Peters, J.; Tateo, D. (2024). One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion, CoRL 2024 Morphology-Aware Policy and Design Learning Workshop.
Bib
Bohlinger, N.; Czechmanowski, G.; Krupka, M.; Kicki, P.; Walas, K.; Peters, J.; Tateo, D. (2024). One Policy to Run Them All: Towards an End-to-end Learning Approach to Multi-Embodiment Locomotion, RSS 2024 Workshop on Embodiment-Aware Robot Learning.
Bib
Bohlinger, N.; Tateo, D.; Kicki, P.; Walas, K.; Peters, J. (2024). Benefits of an Actuated Spine in Agile Quadruped Locomotion, ICRA 2024 Workshop on Bio-inspired Robotics and Robotics for Biology.
Bib
Liu, P.; Guenster, J.; Funk, N.; Groeger, S.; Chen, D.; Bou Ammar, H.; Jankowski, J.; Maric, A.; Calinon, S.; et, al.; Lioutikov, R.; Neumann, G.; Likmeta, A.; Zhalehmehrabi, A.; Bonenfant, T.; Restelli, M.; Tateo, D.; Liu, Z.; Peters, R. (2024). A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics, Advances in Neural Information Processing Systems.
Bib
Liu, P.; Zhang, K.; Tateo, D.; Jauhri, S.; Hu, Z.; Peters, J. Chalvatzaki, G. (2023). Safe Reinforcement Learning of Dynamic High-Dimensional Robotic Tasks: Navigation, Manipulation, Interaction, 2023 IEEE International Conference on Robotics and Automation (ICRA), IEEE.
Bib
Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning, International Conference on Learning Representations (ICLR).
Bib
Urain, J.; Tateo, D.; Peters, J. (2023). Learning Stable Vector Fields on Lie Groups, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), IEEE R-AL Track.
Bib
Bjelonic, F.; Lee, J.; Arm, P.; Sako, D.; Tateo, D.; Peters, J.; Hutter, M. (2023). Learning-Based Design and Control for Quadrupedal Robots With Parallel-Elastic Actuators, IEEE Robotics and Automation Letters (R-AL), 8, 3, pp.1611-1618.
Bib
Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). Least Squares Inverse Q-Learning, European Workshop on Reinforcement Learning (EWRL).
Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2023). LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion, Robot Learning Workshop, Conference on Neural Information Processing Systems (NeurIPS).
Bib
Lach, L.; Haschke, R.; Tateo, D.; Peters, J.; Ritter, H.; Sol, J.; Torras, C. (2023). Towards Transferring Tactile-based Continuous Force Control Policies from Simulation to Robot, NeurIPS 2023 Workshop on Touch Processing.
Bib
Parisi, S.; Tateo, D.; Hensel, M.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2022). Long-Term Visitation Value for Deep Exploration in Sparse-Reward Reinforcement Learning, Algorithms, 15, 3, pp.81.
Bib
Akrour, R.; Tateo, D.; Peters, J. (2022). Continuous Action Reinforcement Learning from a Mixture of Interpretable Experts, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 44, 10, pp.6795-6806.
Bib
Memmel, M.; Liu, P.; Tateo, D.; Peters, J. (2022). Dimensionality Reduction and Prioritized Exploration for Policy Search, 25th International Conference on Artificial Intelligence and Statistics (AISTATS).
Bib
Liu, P.; Zhang, K.; Tateo, D.; Jauhri, S.; Peters, J.; Chalvatzaki, G.; (2022). Regularized Deep Signed Distance Fields for Reactive Motion Generation, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
Bib
Liu, P.; Zhang, K.; Tateo, D.; Jauhri, S.; Peters, J.; Chalvatzaki, G. (2022). ReDSDF: Regularized Deep Signed Distance Fields for Robotics, ICRA 2022 workshop on Motion Planning with Implicit Neural Representations of Geometry.
Bib
Carvalho, J., Tateo, D., Muratore, F., Peters, J. (2021). An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients, International Joint Conference on Neural Networks (IJCNN).
Bib
Liu, P.; Tateo, D.; Bou-Ammar, H.; Peters, J. (2021). Efficient and Reactive Planning for High Speed Robot Air Hockey, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
Bib
Liu, P.; Tateo, D.; Bou-Ammar, H.; Peters, J. (2021). Robot Reinforcement Learning on the Constraint Manifold, Proceedings of the Conference on Robot Learning (CoRL).
Bib
D`Eramo, C.; Tateo, D.; Bonarini, A.; Restelli, M.; Peters, J. (2020). Sharing Knowledge in Multi-Task Deep Reinforcement Learning, International Conference in Learning Representations (ICLR).
Bib
Urain, J.; Ginesi, M.; Tateo, D.; Peters, J. (2020). ImitationFlow: Learning Deep Stable Stochastic Dynamic Systems by Normalizing Flows, IEEE/RSJ International Conference on Intelligent Robots and Systems.
Bib
Urain, J.; Tateo, D.; Ren, T.; Peters, J. (2020). Structured policy representation: Imposing stability in arbitrarily conditioned dynamic systems, NeurIPS 2020, 3rd Robot Learning Workshop, pp.7.
Bib
Tateo, D. (2019). Building structured hierarchical agents, Ph.D. Thesis.
Bib
Beretta, C.; Brizzolari, C.; Tateo, D.; Riva, A.; Amigoni F. (2019). A Sampling-Based Algorithm for Planning Smooth Nonholonomic Paths, European Conference on Mobile Robots (ECMR).
Bib
Tateo, D.; Erdenlig, I. S.; Bonarini, A. (2019). Graph-Based Design of Hierarchical Reinforcement Learning Agents, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE.
Bib
Akrour, R.; Tateo, D.; Peters, J. (2019). Towards Reinforcement Learning of Human Readable Policies, ECML/PKDD Workshop on Deep Continuous-Discrete Machine Learning.
Bib
Tateo, D.; Banfi, J.; Riva, A.; Amigoni, F.; Bonarini, A. (2018). Multiagent Connected Path Planning: PSPACE-Completeness and How to Deal with It, Thirty-Second AAAI Conference on Artificial Intelligence (AAAI2018), pp.4735-4742.
Bib
Tateo, D.; D'Eramo, C.; Nuara, A.; Bonarini, A.; Restelli, M. (2017). Exploiting structure and uncertainty of Bellman updates in Markov decision processes, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).
Bib
Tateo, D.; Pirotta, M.; Restelli, M.; Bonarini, A. (2017). Gradient-based minimization for multi-expert Inverse Reinforcement Learning, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).