Firas Al-Hafez

Research Interests
Reinforcement Learning, Imitation Learning, Robot Learning, Locomotion.
Affiliation
TU Darmstadt, Intelligent Autonomous Systems, Computer Science Department
Contact
firas.al-hafez@tu-darmstadt.de
Room E303, Building S2|02, TU Darmstadt, FB-Informatik, FG-IAS, Hochschulstr. 10, 64289 Darmstadt
+49-6151-16-20811
Firas joined the Intelligent Autonomous System lab on November, 1st, 2021 as a PhD student. He focuses on neuromuscular gait control using inverse reinforcement learning with application on exoskeletons and humanoids. He works with Davide Tateo and Jan Peters within IAS and its research group Safe and Reliable Robot Learning. Previously, he received MSc degrees in Electronic Systems and Mechanical Engineering, as well as a BSc degree in Industrial Engineering field Mechanical Engineering from TU Braunschweig.
During his studies, he completed several applied projects, including a master's thesis on the "Comparison of Reinforcement Learning Algorithms and Evolution Strategies on Robot Manipulation Tasks" at the IRP under the supervision of Jochen Steil and a master's thesis on the "Development of a Tactical Maneuver Planner for Automated Driving Systems in Urban Areas using Reinforcement Learning" at the Volkswagen Group.
Publications
- Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, International Conference on Learning Representations (ICLR).
- Geiss, H.J.; Al-Hafez, F.; Seyfarth, A.; Peters, J.; Tateo, D. (2024). Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion, IEEE-RAS International Conference on Humanoid Robots (Humanoids).
- Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, European Workshop on Reinforcement Learning (EWRL).
- Kaidanov, O.; Al-Hafez, F.; Süvari, Y.; Belousov, B.; Peters, J. (2024). The Role of Domain Randomization in Training Diffusion Policies for Whole-Body Humanoid Control, CoRL 2024 Workshop on Whole-body Control and Bimanual Manipulation: Applications in Humanoids and Beyond.
- Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning, International Conference on Learning Representations (ICLR).
- Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). Least Squares Inverse Q-Learning, European Workshop on Reinforcement Learning (EWRL).
- Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2023). LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion, Robot Learning Workshop, Conference on Neural Information Processing Systems (NeurIPS).
- Al-Hafez, F.; Steil, J. (2021). Redundancy Resolution as Action Bias in Policy Search for Robotic Manipulation, 5th Conference on Robot Learning (CoRL).