Firas Al-Hafez

Research Interests

Reinforcement Learning, Imitation Learning, Robot Learning, Locomotion.

Affiliation

TU Darmstadt, Intelligent Autonomous Systems, Computer Science Department

Contact

firas.al-hafez@tu-darmstadt.de
Room E303, Building S2|02, TU Darmstadt, FB-Informatik, FG-IAS, Hochschulstr. 10, 64289 Darmstadt
+49-6151-16-20811

Firas joined the Intelligent Autonomous System lab on November, 1st, 2021 as a PhD student. He focuses on neuromuscular gait control using inverse reinforcement learning with application on exoskeletons and humanoids. He works with Davide Tateo and Jan Peters within IAS and its research group Safe and Reliable Robot Learning. Previously, he received MSc degrees in Electronic Systems and Mechanical Engineering, as well as a BSc degree in Industrial Engineering field Mechanical Engineering from TU Braunschweig.

During his studies, he completed several applied projects, including a master's thesis on the "Comparison of Reinforcement Learning Algorithms and Evolution Strategies on Robot Manipulation Tasks" at the IRP under the supervision of Jochen Steil and a master's thesis on the "Development of a Tactical Maneuver Planner for Automated Driving Systems in Urban Areas using Reinforcement Learning" at the Volkswagen Group.

Publications

Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, International Conference on Learning Representations (ICLR).
Bib
Geiss, H.J.; Al-Hafez, F.; Seyfarth, A.; Peters, J.; Tateo, D. (2024). Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid Locomotion, IEEE-RAS International Conference on Humanoid Robots (Humanoids).
Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2024). Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, European Workshop on Reinforcement Learning (EWRL).
Bib
Kaidanov, O.; Al-Hafez, F.; Süvari, Y.; Belousov, B.; Peters, J. (2024). The Role of Domain Randomization in Training Diffusion Policies for Whole-Body Humanoid Control, CoRL 2024 Workshop on Whole-body Control and Bimanual Manipulation: Applications in Humanoids and Beyond.
Bib
Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning, International Conference on Learning Representations (ICLR).
Bib
Al-Hafez, F.; Tateo, D.; Arenz, O.; Zhao, G.; Peters, J. (2023). Least Squares Inverse Q-Learning, European Workshop on Reinforcement Learning (EWRL).
Bib
Al-Hafez, F.; Zhao, G.; Peters, J.; Tateo, D. (2023). LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion, Robot Learning Workshop, Conference on Neural Information Processing Systems (NeurIPS).
Bib
Al-Hafez, F.; Steil, J. (2021). Redundancy Resolution as Action Bias in Policy Search for Robotic Manipulation, 5th Conference on Robot Learning (CoRL).

# (:titlesearch alhafez)