Samuele Tosatto's Publications

  •     Bib
    Tosatto, S.; Carvalho, J.; Peters, J. (2022). Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 44, 10, pp.5996--6010.
  •       Bib
    Tosatto, S.; Akrour, R.; Peters, J. (2021). An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions, Stats, 4, pp.1--17.
  •     Bib
    Tosatto, S. (2021). Off-Policy Reinforcement Learning for Robotics, PhD Thesis.
  •     Bib
    Tosatto, S.; Chalvatzaki, G.; Peters, J. (2021). Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA).
  •     Bib
    Tosatto, S.; Carvalho, J.; Abdulsamad, H.; Peters, J. (2020). A Nonparametric Off-Policy Policy Gradient, Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS).
  •     Bib
    Tosatto, S.; Stadtmueller, J.; Peters, J. (2020). Dimensionality Reduction of Movement Primitives in Parameter Space, arXiv.
  •     Bib
    Tosatto, S.; Akrour, R., Peters, J. (2020). An Upper Bound of the Bias of Nadaraya–Watson Kernel Regression under Lipschitz Assumptions, MDPI Stats, 4, pp.1--17.
  •     Bib
    Tosatto, S.; D'Eramo, C.; Pajarinen, J.; Restelli, M.; Peters, J. (2019). Exploration Driven By an Optimistic Bellman Equation, Proceedings of the International Joint Conference on Neural Networks (IJCNN).
  •       Bib
    Tosatto, S.; D'Eramo, C.; Pajarinen, J.; Restelli, M.; Peters, J. (2018). Technical Report: Exploration Driven by an Optimistic Bellman Equation.
  •     Bib
    Tosatto, S.; D'Eramo, C.; Pirotta, M.; Restelli, M. (2017). Boosted Fitted Q-Iteration, Polytechnic University of Milan.
  •     Bib
    Tosatto, S.; Pirotta, M.; D'Eramo, C; Restelli, M. (2017). Boosted Fitted Q-Iteration, Proceedings of the International Conference of Machine Learning (ICML).
  •     Bib
    Rueckert, E.; Nakatenus, M.; Tosatto, S.; Peters, J. (2017). Learning Inverse Dynamics Models in O(n) time with LSTM networks, Proceedings of the International Conference on Humanoid Robots (HUMANOIDS).