Machine Learning, Robotics, Reinforcement Learning
Personal Website Publications Google Scholar Research Gate
Mail.
João Carvalho
TU Darmstadt, FG-IAS
Hochschulstr. 10, 64289 Darmstadt
Office. Room E225, Building S2|02
+49-6151-16-20073
joao.correia_carvalho@tu-darmstadt.de
His master's thesis entitled "Nonparametric Off-Policy Policy Gradient" was written at IAS, supervised by Samuele Tosatto, and explored an approach to obtain an off-policy gradient update with better sample-efficiency.