Anish Diwan

Research Interests
- Reinforcement Learning
- Imitation Learning
- Learning Under Partial Observability
- Score-Based Generative Models
Contact
Room E226, Building S2|02
Hochschulstr. 10, 64289 Darmstadt
anish.diwan@tu-darmstadt.de
anish.diwan@robot-learning.de
Anish Diwan joined the Intelligent Autonomous Systems Group as a Ph.D. student in April 2025. Currently, he works on imitation learning and reinforcement learning under partial observability, and exploring techniques like score-based generative models for learning data-driven reward functions.
Before his PhD, Anish Diwan completed his Master Degree in Robotics at the Technische Universiteit Delft with Cum Laude honours. His thesis entitled “Noise-conditioned Energy-based Annealed Rewards" was written under the supervision of Prof. Jens Kober , Prof. Jan Peters, and Julen Urain.
Research Interests
Reinforcement Learning, Imitation Learning, Learning Under Partial Observability, Score-Based Generative Models
Key References
-
- Diwan, A.A.; Urain, J.; Kober, J.; Peters, J. (2025). Noise-conditioned Energy-based Annealed Rewards (NEAR): A Generative Framework for Imitation Learning from Observation, International Conference on Learning Representations (ICLR).