Publications | ROB.AI-Lab.Science | Chair of Cyber-Physical-Systems

Publication List with Images

2025
Dave, Vedant; Rueckert, Elmar Skill Disentanglement in Reproducing Kernel Hilbert Space Proceedings Article In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 16153-16162, 2025. Abstract \| Links \| BibTeX \| Tags: Deep Learning, neural network, Reinforcement Learning, Skill Discovery, Unsupervised Learning @inproceedings{Dave2025bb, title = {Skill Disentanglement in Reproducing Kernel Hilbert Space}, author = {Vedant Dave and Elmar Rueckert }, url = {https://cloud.cps.unileoben.ac.at/index.php/s/m9XKo4t2FXAH6Cs}, doi = {https://doi.org/10.1609/aaai.v39i15.33774}, year = {2025}, date = {2025-04-11}, urldate = {2025-02-27}, booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)}, volume = {39}, number = {15}, pages = {16153-16162}, abstract = {Unsupervised Skill Discovery aims at learning diverse skills without any extrinsic rewards and leverage them as prior for learning a variety of downstream tasks. Existing approaches to unsupervised reinforcement learning typically involve discovering skills through empowerment-driven techniques or by maximizing entropy to encourage exploration. However, this mutual information objective often results in either static skills that discourage exploration or maximise coverage at the expense of non-discriminable skills. Instead of focusing only on maximizing bounds on f-divergence, we combine it with Integral Probability Metrics to maximize the distance between distributions to promote behavioural diversity and enforce disentanglement. Our method, Hilbert Unsupervised Skill Discovery (HUSD), provides an additional objective that seeks to obtain exploration and separability of state-skill pairs by maximizing the Maximum Mean Discrepancy between the joint distribution of skills and states and the product of their marginals in Reproducing Kernel Hilbert Space. Our results on Unsupervised RL Benchmark show that HUSD outperforms previous exploration algorithms on state-based tasks.}, keywords = {Deep Learning, neural network, Reinforcement Learning, Skill Discovery, Unsupervised Learning}, pubstate = {published}, tppubtype = {inproceedings} } Close Unsupervised Skill Discovery aims at learning diverse skills without any extrinsic rewards and leverage them as prior for learning a variety of downstream tasks. Existing approaches to unsupervised reinforcement learning typically involve discovering skills through empowerment-driven techniques or by maximizing entropy to encourage exploration. However, this mutual information objective often results in either static skills that discourage exploration or maximise coverage at the expense of non-discriminable skills. Instead of focusing only on maximizing bounds on f-divergence, we combine it with Integral Probability Metrics to maximize the distance between distributions to promote behavioural diversity and enforce disentanglement. Our method, Hilbert Unsupervised Skill Discovery (HUSD), provides an additional objective that seeks to obtain exploration and separability of state-skill pairs by maximizing the Maximum Mean Discrepancy between the joint distribution of skills and states and the product of their marginals in Reproducing Kernel Hilbert Space. Our results on Unsupervised RL Benchmark show that HUSD outperforms previous exploration algorithms on state-based tasks. Close https://cloud.cps.unileoben.ac.at/index.php/s/m9XKo4t2FXAH6Cs doi:https://doi.org/10.1609/aaai.v39i15.33774 Close
Koinig, Gerald; Neubauer, Melanie; Martinelli, Walter; Radmann, Yves; Kuhn, Nikolai; Fink, Thomas; Rueckert, Elmar; Tischberger-Aldrian, Alexia CNN-based copper reduction in shredded scrap for enhanced electric arc furnace steelmaking Proceedings Article In: International Conference on Optical Characterization of Materials (OCM 2025), pp. 319-328, 2025, ISBN: 9783731514084. Links \| BibTeX \| Tags: Applied Deep Learning, neural network, Recycling @inproceedings{nokey, title = {CNN-based copper reduction in shredded scrap for enhanced electric arc furnace steelmaking}, author = {Gerald Koinig and Melanie Neubauer and Walter Martinelli and Yves Radmann and Nikolai Kuhn and Thomas Fink and Elmar Rueckert and Alexia Tischberger-Aldrian}, url = {http://www.scopus.com/inward/record.url?scp=105005090678&partnerID=8YFLogxK https://books.google.de/books?hl=de&lr=&id=cQtZEQAAQBAJ&oi=fnd&pg=PA329&dq=CNN-based+copper+reduction+in+shredded+scrap+for+enhanced+electric+arc+furnace+steelmaking&ots=UK8_ZX8DWo&sig=9itL3MMW7ZDb1HK5rucwcYwVzG0 }, isbn = {9783731514084}, year = {2025}, date = {2025-03-26}, urldate = {2025-03-26}, booktitle = {International Conference on Optical Characterization of Materials (OCM 2025)}, pages = {319-328}, keywords = {Applied Deep Learning, neural network, Recycling}, pubstate = {published}, tppubtype = {inproceedings} } Close http://www.scopus.com/inward/record.url?scp=105005090678&partnerID=8YFLogxK https://books.google.de/books?hl=de&lr=&id=cQtZEQAAQBAJ&oi=fnd&p[…] Close
2020
Tanneberg, Daniel; Rueckert, Elmar; Peters, Jan Evolutionary training and abstraction yields algorithmic generalization of neural computers Journal Article In: Nature Machine Intelligence, pp. 1–11, 2020. Links \| BibTeX \| Tags: neural network, Reinforcement Learning, Transfer Learning @article{Tanneberg2020, title = {Evolutionary training and abstraction yields algorithmic generalization of neural computers}, author = {Daniel Tanneberg and Elmar Rueckert and Jan Peters }, url = {https://rdcu.be/caRlg, Article File}, doi = {10.1038/s42256-020-00255-1}, year = {2020}, date = {2020-10-10}, journal = {Nature Machine Intelligence}, pages = {1--11}, keywords = {neural network, Reinforcement Learning, Transfer Learning}, pubstate = {published}, tppubtype = {article} } Close Article File doi:10.1038/s42256-020-00255-1 Close
2019
Tanneberg, Daniel; Peters, Jan; Rueckert, Elmar Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks Journal Article In: Neural Networks – Elsevier, vol. 109, pp. 67-80, 2019, ISBN: 0893-6080, (Impact Factor of 7.197 (2017)). Links \| BibTeX \| Tags: neural network, Probabilistic Inference, RNN, spiking @article{Tanneberg2019, title = {Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks}, author = {Daniel Tanneberg and Jan Peters and Elmar Rueckert}, url = {https://cps.unileoben.ac.at/wp/NeuralNetworks2018Tanneberg.pdf, Article File}, doi = {10.1016/j.neunet.2018.10.005}, isbn = {0893-6080}, year = {2019}, date = {2019-01-01}, journal = {Neural Networks - Elsevier}, volume = {109}, pages = {67-80}, note = {Impact Factor of 7.197 (2017)}, keywords = {neural network, Probabilistic Inference, RNN, spiking}, pubstate = {published}, tppubtype = {article} } Close Article File doi:10.1016/j.neunet.2018.10.005 Close
2018
Gondaliya, Kaushikkumar D.; Peters, Jan; Rueckert, Elmar Learning to Categorize Bug Reports with LSTM Networks Proceedings Article In: Proceedings of the International Conference on Advances in System Testing and Validation Lifecycle (VALID)., pp. 6, XPS (Xpert Publishing Services), Nice, France, 2018, ISBN: 978-1-61208-671-2, ( October 14-18, 2018). Links \| BibTeX \| Tags: Natural Language Processing, neural network, RNN @inproceedings{Gondaliya2018, title = {Learning to Categorize Bug Reports with LSTM Networks}, author = {Kaushikkumar D. Gondaliya and Jan Peters and Elmar Rueckert}, url = {https://cps.unileoben.ac.at/wp/VALID2018Gondaliya.pdf, Article File}, isbn = {978-1-61208-671-2}, year = {2018}, date = {2018-10-14}, booktitle = {Proceedings of the International Conference on Advances in System Testing and Validation Lifecycle (VALID).}, pages = {6}, publisher = {XPS (Xpert Publishing Services)}, address = {Nice, France}, note = { October 14-18, 2018}, keywords = {Natural Language Processing, neural network, RNN}, pubstate = {published}, tppubtype = {inproceedings} } Close Article File Close
2015
Calandra, Roberto; Ivaldi, Serena; Deisenroth, Marc; Rueckert, Elmar; Peters, Jan Learning Inverse Dynamics Models with Contacts Proceedings Article In: Proceedings of the International Conference on Robotics and Automation (ICRA), 2015. Links \| BibTeX \| Tags: inverse dynamics, model learning, neural network @inproceedings{Calandra2015, title = {Learning Inverse Dynamics Models with Contacts}, author = {Roberto Calandra and Serena Ivaldi and Marc Deisenroth and Elmar Rueckert and Jan Peters}, url = {https://cps.unileoben.ac.at/wp/ICRA15Calandra.pdf, Article File}, year = {2015}, date = {2015-01-01}, booktitle = {Proceedings of the International Conference on Robotics and Automation (ICRA)}, crossref = {p10794}, key = {codyco}, keywords = {inverse dynamics, model learning, neural network}, pubstate = {published}, tppubtype = {inproceedings} } Close Article File Close

Compact List without Images

Journal Articles

Tanneberg, Daniel; Rueckert, Elmar; Peters, Jan

Evolutionary training and abstraction yields algorithmic generalization of neural computers Journal Article

In: Nature Machine Intelligence, pp. 1–11, 2020.

Links | BibTeX

Tanneberg, Daniel; Peters, Jan; Rueckert, Elmar

Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks Journal Article

In: Neural Networks – Elsevier, vol. 109, pp. 67-80, 2019, ISBN: 0893-6080, (Impact Factor of 7.197 (2017)).

Links | BibTeX

Proceedings Articles

Dave, Vedant; Rueckert, Elmar

Skill Disentanglement in Reproducing Kernel Hilbert Space Proceedings Article

In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 16153-16162, 2025.

Abstract | Links | BibTeX

Koinig, Gerald; Neubauer, Melanie; Martinelli, Walter; Radmann, Yves; Kuhn, Nikolai; Fink, Thomas; Rueckert, Elmar; Tischberger-Aldrian, Alexia

CNN-based copper reduction in shredded scrap for enhanced electric arc furnace steelmaking Proceedings Article

In: International Conference on Optical Characterization of Materials (OCM 2025), pp. 319-328, 2025, ISBN: 9783731514084.

Links | BibTeX

Gondaliya, Kaushikkumar D.; Peters, Jan; Rueckert, Elmar

Learning to Categorize Bug Reports with LSTM Networks Proceedings Article

In: Proceedings of the International Conference on Advances in System Testing and Validation Lifecycle (VALID)., pp. 6, XPS (Xpert Publishing Services), Nice, France, 2018, ISBN: 978-1-61208-671-2, ( October 14-18, 2018).

Links | BibTeX

Calandra, Roberto; Ivaldi, Serena; Deisenroth, Marc; Rueckert, Elmar; Peters, Jan

Learning Inverse Dynamics Models with Contacts Proceedings Article

In: Proceedings of the International Conference on Robotics and Automation (ICRA), 2015.

Links | BibTeX

2025
Dave, Vedant; Rueckert, Elmar Skill Disentanglement in Reproducing Kernel Hilbert Space Proceedings Article In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pp. 16153-16162, 2025. Abstract \| Links \| BibTeX \| Tags: Deep Learning, neural network, Reinforcement Learning, Skill Discovery, Unsupervised Learning @inproceedings{Dave2025bb, title = {Skill Disentanglement in Reproducing Kernel Hilbert Space}, author = {Vedant Dave and Elmar Rueckert }, url = {https://cloud.cps.unileoben.ac.at/index.php/s/m9XKo4t2FXAH6Cs}, doi = {https://doi.org/10.1609/aaai.v39i15.33774}, year = {2025}, date = {2025-04-11}, urldate = {2025-02-27}, booktitle = {Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)}, volume = {39}, number = {15}, pages = {16153-16162}, abstract = {Unsupervised Skill Discovery aims at learning diverse skills without any extrinsic rewards and leverage them as prior for learning a variety of downstream tasks. Existing approaches to unsupervised reinforcement learning typically involve discovering skills through empowerment-driven techniques or by maximizing entropy to encourage exploration. However, this mutual information objective often results in either static skills that discourage exploration or maximise coverage at the expense of non-discriminable skills. Instead of focusing only on maximizing bounds on f-divergence, we combine it with Integral Probability Metrics to maximize the distance between distributions to promote behavioural diversity and enforce disentanglement. Our method, Hilbert Unsupervised Skill Discovery (HUSD), provides an additional objective that seeks to obtain exploration and separability of state-skill pairs by maximizing the Maximum Mean Discrepancy between the joint distribution of skills and states and the product of their marginals in Reproducing Kernel Hilbert Space. Our results on Unsupervised RL Benchmark show that HUSD outperforms previous exploration algorithms on state-based tasks.}, keywords = {Deep Learning, neural network, Reinforcement Learning, Skill Discovery, Unsupervised Learning}, pubstate = {published}, tppubtype = {inproceedings} } Close Unsupervised Skill Discovery aims at learning diverse skills without any extrinsic rewards and leverage them as prior for learning a variety of downstream tasks. Existing approaches to unsupervised reinforcement learning typically involve discovering skills through empowerment-driven techniques or by maximizing entropy to encourage exploration. However, this mutual information objective often results in either static skills that discourage exploration or maximise coverage at the expense of non-discriminable skills. Instead of focusing only on maximizing bounds on f-divergence, we combine it with Integral Probability Metrics to maximize the distance between distributions to promote behavioural diversity and enforce disentanglement. Our method, Hilbert Unsupervised Skill Discovery (HUSD), provides an additional objective that seeks to obtain exploration and separability of state-skill pairs by maximizing the Maximum Mean Discrepancy between the joint distribution of skills and states and the product of their marginals in Reproducing Kernel Hilbert Space. Our results on Unsupervised RL Benchmark show that HUSD outperforms previous exploration algorithms on state-based tasks. Close https://cloud.cps.unileoben.ac.at/index.php/s/m9XKo4t2FXAH6Cs doi:https://doi.org/10.1609/aaai.v39i15.33774 Close
Koinig, Gerald; Neubauer, Melanie; Martinelli, Walter; Radmann, Yves; Kuhn, Nikolai; Fink, Thomas; Rueckert, Elmar; Tischberger-Aldrian, Alexia CNN-based copper reduction in shredded scrap for enhanced electric arc furnace steelmaking Proceedings Article In: International Conference on Optical Characterization of Materials (OCM 2025), pp. 319-328, 2025, ISBN: 9783731514084. Links \| BibTeX \| Tags: Applied Deep Learning, neural network, Recycling @inproceedings{nokey, title = {CNN-based copper reduction in shredded scrap for enhanced electric arc furnace steelmaking}, author = {Gerald Koinig and Melanie Neubauer and Walter Martinelli and Yves Radmann and Nikolai Kuhn and Thomas Fink and Elmar Rueckert and Alexia Tischberger-Aldrian}, url = {http://www.scopus.com/inward/record.url?scp=105005090678&partnerID=8YFLogxK https://books.google.de/books?hl=de&lr=&id=cQtZEQAAQBAJ&oi=fnd&pg=PA329&dq=CNN-based+copper+reduction+in+shredded+scrap+for+enhanced+electric+arc+furnace+steelmaking&ots=UK8_ZX8DWo&sig=9itL3MMW7ZDb1HK5rucwcYwVzG0 }, isbn = {9783731514084}, year = {2025}, date = {2025-03-26}, urldate = {2025-03-26}, booktitle = {International Conference on Optical Characterization of Materials (OCM 2025)}, pages = {319-328}, keywords = {Applied Deep Learning, neural network, Recycling}, pubstate = {published}, tppubtype = {inproceedings} } Close http://www.scopus.com/inward/record.url?scp=105005090678&partnerID=8YFLogxK https://books.google.de/books?hl=de&lr=&id=cQtZEQAAQBAJ&oi=fnd&p[…] Close
2020
Tanneberg, Daniel; Rueckert, Elmar; Peters, Jan Evolutionary training and abstraction yields algorithmic generalization of neural computers Journal Article In: Nature Machine Intelligence, pp. 1–11, 2020. Links \| BibTeX \| Tags: neural network, Reinforcement Learning, Transfer Learning @article{Tanneberg2020, title = {Evolutionary training and abstraction yields algorithmic generalization of neural computers}, author = {Daniel Tanneberg and Elmar Rueckert and Jan Peters }, url = {https://rdcu.be/caRlg, Article File}, doi = {10.1038/s42256-020-00255-1}, year = {2020}, date = {2020-10-10}, journal = {Nature Machine Intelligence}, pages = {1--11}, keywords = {neural network, Reinforcement Learning, Transfer Learning}, pubstate = {published}, tppubtype = {article} } Close Article File doi:10.1038/s42256-020-00255-1 Close
2019
Tanneberg, Daniel; Peters, Jan; Rueckert, Elmar Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks Journal Article In: Neural Networks – Elsevier, vol. 109, pp. 67-80, 2019, ISBN: 0893-6080, (Impact Factor of 7.197 (2017)). Links \| BibTeX \| Tags: neural network, Probabilistic Inference, RNN, spiking @article{Tanneberg2019, title = {Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks}, author = {Daniel Tanneberg and Jan Peters and Elmar Rueckert}, url = {https://cps.unileoben.ac.at/wp/NeuralNetworks2018Tanneberg.pdf, Article File}, doi = {10.1016/j.neunet.2018.10.005}, isbn = {0893-6080}, year = {2019}, date = {2019-01-01}, journal = {Neural Networks - Elsevier}, volume = {109}, pages = {67-80}, note = {Impact Factor of 7.197 (2017)}, keywords = {neural network, Probabilistic Inference, RNN, spiking}, pubstate = {published}, tppubtype = {article} } Close Article File doi:10.1016/j.neunet.2018.10.005 Close
2018
Gondaliya, Kaushikkumar D.; Peters, Jan; Rueckert, Elmar Learning to Categorize Bug Reports with LSTM Networks Proceedings Article In: Proceedings of the International Conference on Advances in System Testing and Validation Lifecycle (VALID)., pp. 6, XPS (Xpert Publishing Services), Nice, France, 2018, ISBN: 978-1-61208-671-2, ( October 14-18, 2018). Links \| BibTeX \| Tags: Natural Language Processing, neural network, RNN @inproceedings{Gondaliya2018, title = {Learning to Categorize Bug Reports with LSTM Networks}, author = {Kaushikkumar D. Gondaliya and Jan Peters and Elmar Rueckert}, url = {https://cps.unileoben.ac.at/wp/VALID2018Gondaliya.pdf, Article File}, isbn = {978-1-61208-671-2}, year = {2018}, date = {2018-10-14}, booktitle = {Proceedings of the International Conference on Advances in System Testing and Validation Lifecycle (VALID).}, pages = {6}, publisher = {XPS (Xpert Publishing Services)}, address = {Nice, France}, note = { October 14-18, 2018}, keywords = {Natural Language Processing, neural network, RNN}, pubstate = {published}, tppubtype = {inproceedings} } Close Article File Close
2015
Calandra, Roberto; Ivaldi, Serena; Deisenroth, Marc; Rueckert, Elmar; Peters, Jan Learning Inverse Dynamics Models with Contacts Proceedings Article In: Proceedings of the International Conference on Robotics and Automation (ICRA), 2015. Links \| BibTeX \| Tags: inverse dynamics, model learning, neural network @inproceedings{Calandra2015, title = {Learning Inverse Dynamics Models with Contacts}, author = {Roberto Calandra and Serena Ivaldi and Marc Deisenroth and Elmar Rueckert and Jan Peters}, url = {https://cps.unileoben.ac.at/wp/ICRA15Calandra.pdf, Article File}, year = {2015}, date = {2015-01-01}, booktitle = {Proceedings of the International Conference on Robotics and Automation (ICRA)}, crossref = {p10794}, key = {codyco}, keywords = {inverse dynamics, model learning, neural network}, pubstate = {published}, tppubtype = {inproceedings} } Close Article File Close