Intelligent Autonomous Systems | Team / Publications: Joni Pajarinen

Journal Papers
Bib Klink, P.; Wolf, F.; Ploeger, K.; Peters, J.; Pajarinen, J. (submitted). Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning, Submitted to the IEEE Transactions on Robotics (T-Ro). Bib Rother, D.; Herbert, F.; Kalter, F.; Koert, D.; Pajarinen, J.; Peters, J.; Weisswange, T.H. (2025). Entropy based blending of policies for multi-agent coexistence., Autonomous Agents Multi Agent Systems, 39, pp.27. Bib Dam, T.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2024). A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search, Journal of Artificial Intelligence Research (JAIR), 81, pp.511-577. Bib Klink, P.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2024). On the Benefit of Optimal Transport for Curriculum Reinforcement Learning, IEEE Transaction on Pattern Analysis and Machine Intelligence (PAMI), 46, 11, pp.7191 - 7204, IEEE. Bib Weng, Y.; Chun, S.; Ohashi, M.; Matsuda, T.; Sekimoria, Y.; Pajarinen, J.; Peters, J.; Maki, T. (2024). Autonomous Underwater Vehicle Link Alignment Control in Unknown Environments Using Reinforcement Learning, Journal of Field Robotics, 41, 6, pp.1724--1743. Bib Parisi, S.; Tateo, D.; Hensel, M.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2022). Long-Term Visitation Value for Deep Exploration in Sparse-Reward Reinforcement Learning, Algorithms, 15, 3, pp.81. Bib Weng, Y.; Pajarinen, J.; Akrour, R.; Matsuda, T.; Peters, J.; Maki, T. (2022). Reinforcement Learning Based Underwater Wireless Optical Communication Alignment for Multiple Autonomous Underwater Vehicles, IEEE Journal of Oceanic Engineering, 47, 4, pp.1231-1245. Bib Weng, Y.; Matsuda, T.; Sekimuri, Y.; Pajarinen, J.; Peters, J.; Maki, T. (2022). Pointing Error Control of Underwater Wireless Optical Communication on Mobile Platform, IEEE Photonics Technology Letters, 34, 13, pp.699-702. Bib Weng, Y.; Matsuda, T.; Sekimoria, Y.; Pajarinen, J.; Peters, J.; Maki, T. (2022). Establishment of Line-of-Sight Optical Links Between Autonomous Underwater Vehicles: Field Experiment and Performance Validation, Applied Ocean Research, 129. Bib Dam, T.; Chalvatzaki, G.; Peters, J.; Pajarinen, J. (2022). Monte-carlo robot path planning, IEEE Robotics and Automation Letters (RA-L), 7, 4, pp.11213-11220. Bib Klink, P.; Abdulsamad, H.; Belousov, B.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2021). A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning, Journal of Machine Learning Research (JMLR). Bib Pajarinen, J.; Arenz, O.; Peters, J.; Neumann, N. (2020). Probabilistic approach to physical object disentangling, IEEE Robotics and Automation Letters (RA-L). Bib Lauri, M.; Pajarinen, J.; Peters, J.; Frintrop, S. (2020). Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization, IEEE Robotics and Automation Letters (RA-L), 5, 4, pp.5323-5330. Bib Pajarinen, J.; Thai, H.L.; Akrour, R.; Peters, J.; Neumann, G. (2019). Compatible natural gradient policy search, Machine Learning (MLJ), 108, 8, pp.1443--1466, Springer. Bib Koert, D.; Pajarinen, J.; Schotschneider, A.; Trick, S., Rothkopf, C.; Peters, J. (2019). Learning Intention Aware Online Adaptation of Movement Primitives, IEEE Robotics and Automation Letters (RA-L), with presentation at the IEEE International Conference on Intelligent Robots and Systems (IROS). Bib Osa, T.; Pajarinen, J.; Neumann, G.; Bagnell, J.A.; Abbeel, P.; Peters, J. (2018). An Algorithmic Perspective on Imitation Learning, Foundations and Trends in Robotics. Bib Pajarinen, J.; Kyrki, V. (2015). Robotic manipulation of multiple objects as a POMDP, Artificial Intelligence. Bib Pajarinen, J.; Hottinen, A.; Peltonen, J. (2013). Optimizing Spatial and Temporal Reuse in Wireless Networks by Decentralized Partially Observable Markov Decision Processes, IEEE Transactions on Mobile Computing. Bib Pajarinen, J.; Peltonen, J.; Uusitalo, M.A. (2011). Fault tolerant machine learning for nanoscale cognitive radio, Neurocomputing.

Conference Papers
Bib Klink, P.; D`Eramo, C.; Peters, J.; Pajarinen, J. (2022). Boosted Curriculum Reinforcement Learning, International Conference on Learning Representations (ICLR). Bib Klink, P.; Yang, H.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2022). Curriculum Reinforcement Learning via Constrained Optimal Transport, International Conference on Machine Learning (ICML). Bib Watson, J.; Lin J. A.; Klink, P.; Pajarinen, J.; Peters, J. (2021). Latent Derivative Bayesian Last Layer Networks, Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS). Bib Dam, T.; Klink, P.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2020). Generalized Mean Estimation in Monte-Carlo Tree Search, Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI). Bib Laux, M.; Arenz, O.; Pajarinen, J.; Peters, J. (2020). Deep Adversarial Reinforcement Learning for Object Disentangling, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020). Bib Klink, P.; D'Eramo, C.; Peters, J.; Pajarinen, J. (2020). Self-Paced Deep Reinforcement Learning, Advances in Neural Information Processing Systems (NIPS / NeurIPS). Bib Lauri, M.; Pajarinen, J.; Peters, J. (2019). Information gathering in decentralized POMDPs by policy graph improvement, Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS). Bib Tosatto, S.; D'Eramo, C.; Pajarinen, J.; Restelli, M.; Peters, J. (2019). Exploration Driven By an Optimistic Bellman Equation, Proceedings of the International Joint Conference on Neural Networks (IJCNN). Bib Akrour, R.; Pajarinen, J.; Neumann, G.; Peters, J. (2019). Projections for Approximate Policy Iteration Algorithms, Proceedings of the International Conference on Machine Learning (ICML). Bib Hoelscher, J.; Koert, D.; Peters, J.; Pajarinen, J. (2018). Utilizing Human Feedback in POMDP Execution and Specification, Proceedings of the International Conference on Humanoid Robots (HUMANOIDS). Bib Pajarinen, J.; Kyrki, V.; Koval, M.; Srinivasa, S; Peters, J.; Neumann, G. (2017). Hybrid Control Trajectory Optimization under Uncertainty, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Bib Luck, K.S.; Pajarinen, J.; Berger, E.; Kyrki, V.; Ben Amor, H. (2016). Sparse Latent Space Policy Search, Proceedings of the Thirtieth AAAI Conference On Artificial Intelligence (AAAI). Bib Ikkala, A.; Pajarinen, J.; Kyrki, V. (2016). Benchmarking RGB-D Segmentation: Toy Dataset of Complex Crowded Scenes, Proceedings of the 11th International Conference on Computer Vision Theory and Applications (VISAPP). Bib Racca, M.; Pajarinen, J.; Montebelli, A.; Kyrki, V. (2016). Learning In-Contact Control Strategies from Demonstration, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Bib Pajarinen, J.; Kyrki, V. (2015). Decision Making Under Uncertain Segmentations, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). Bib Pajarinen, J.; Kyrki, V. (2014). Robotic manipulation in object composition space, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Bib Kondaxakis, P.; Pajarinen, J.; and Kyrki, V. (2014). Real-Time Recognition of Pointing Gestures for Robot to Robot Interaction, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Bib Pajarinen, J.; Peltonen, J. (2013). Expectation Maximization for Average Reward Decentralized POMDPs, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD). Bib Pajarinen, J.; Peltonen, J. (2011). Periodic Finite State Controllers for Efficient POMDP and DEC-POMDP Planning, Proceedings of the 25th Annual Conference on Neural Information Processing Systems (NIPS). Bib Pajarinen, J.; Peltonen, J. (2011). Efficient Planning for Factored Infinite-Horizon DEC-POMDPs, Proceedings of the 22nd International Joint Conference on Artificial Intelligence (IJCAI). Bib Pajarinen, J.; Peltonen, J.; Hottinen, A.; Uusitalo, M.A. (2010). Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD). Bib Pajarinen, J.; Peltonen, J.; Uusitalo, M.A.; Hottinen, A. (2009). Latent state models of primary user behavior for opportunistic spectrum access, Proceedings of the IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC). Bib Peltonen J.; Uusitalo, M.A.; Pajarinen, J. (2008). Nano-scale fault tolerant machine learning for cognitive radio, Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

Thesis
Bib Pajarinen, J. (2013). Planning under uncertainty for large-scale problems with applications to wireless networking.

Patents
Bib Uusitalo, M.A.; Hottinen, A.; Peltonen, J.; Pajarinen, J. (2014). Forecasting of dynamic environmental parameters to optimize operation of a wireless communication system. Bib Kuukankorpi, A.; Pajarinen, J.; Jalio, C.; Nippula, M. (2006). Processing of data packets within a network element cluster.