Samuele Tosatto's Publications

Tosatto, S.; Carvalho, J.; Peters, J. (in press). Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI).   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; Akrour, R.; Peters, J. (2021). An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions, Stats, 4, pp.1--17.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S. (2021). Off-Policy Reinforcement Learning for Robotics, PhD Thesis.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; Chalvatzaki, G.; Peters, J. (2021). Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA).   BibTeX Reference [BibTex]

Tosatto, S.; Carvalho, J.; Abdulsamad, H.; Peters, J. (2020). A Nonparametric Off-Policy Policy Gradient, Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS).   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; Stadtmueller, J.; Peters, J. (2020). Dimensionality Reduction of Movement Primitives in Parameter Space, arXiv.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; Akrour, R., Peters, J. (2020). An Upper Bound of the Bias of Nadaraya–Watson Kernel Regression under Lipschitz Assumptions, MDPI Stats, 4, pp.1--17.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; D'Eramo, C.; Pajarinen, J.; Restelli, M.; Peters, J. (2019). Exploration Driven By an Optimistic Bellman Equation, Proceedings of the International Joint Conference on Neural Networks (IJCNN).   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; D'Eramo, C.; Pajarinen, J.; Restelli, M.; Peters, J. (2018). Technical Report: Exploration Driven by an Optimistic Bellman Equation.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; D'Eramo, C.; Pirotta, M.; Restelli, M. (2017). Boosted Fitted Q-Iteration, Polytechnic University of Milan.   Download Article [PDF]   BibTeX Reference [BibTex]

Tosatto, S.; Pirotta, M.; D'Eramo, C; Restelli, M. (2017). Boosted Fitted Q-Iteration, Proceedings of the International Conference of Machine Learning (ICML).   Download Article [PDF]   BibTeX Reference [BibTex]

Rueckert, E.; Nakatenus, M.; Tosatto, S.; Peters, J. (2017). Learning Inverse Dynamics Models in O(n) time with LSTM networks, Proceedings of the International Conference on Humanoid Robots (HUMANOIDS).   Download Article [PDF]   BibTeX Reference [BibTex]

  

zum Seitenanfang