Publications | ROB.AI-Lab.Science | Chair of Cyber-Physical-Systems

Publication List with Images

2025
Nwankwo, Linus; Ellensohn, Bjoern; Dave, Vedant; Hofer, Peter; Forstner, Jan; Villneuve, Marlene; Galler, Robert; Rueckert, Elmar EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Proceedings Article In: IEEE International Conference on Robotics and Automation (ICRA 2025)., 2025. Links \| BibTeX \| Tags: Autonomous Navigation, robotics, SLAM @inproceedings{Nwankwo2025, title = {EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments}, author = {Linus Nwankwo and Bjoern Ellensohn and Vedant Dave and Peter Hofer and Jan Forstner and Marlene Villneuve and Robert Galler and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo}, year = {2025}, date = {2025-01-27}, urldate = {2025-01-27}, booktitle = {IEEE International Conference on Robotics and Automation (ICRA 2025).}, keywords = {Autonomous Navigation, robotics, SLAM}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo Close
2024
Nwankwo, Linus; Rueckert, Elmar The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Large Language Models @inproceedings{Nwankwo2024, title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language}, author = {Linus Nwankwo and Elmar Rueckert}, url = {https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs}, doi = {10.1145/3610978.3640723}, isbn = {9798400703232}, year = {2024}, date = {2024-01-16}, urldate = {2024-01-16}, booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.}, pages = {808–812}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, organization = {ACM/IEEE}, series = {HRI '24}, abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.}, howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)}, key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction}, note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 }, keywords = {Autonomous Navigation, Large Language Models}, pubstate = {published}, tppubtype = {inproceedings} } Close In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository. Close https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs doi:10.1145/3610978.3640723 Close
2023
Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Ngoc Thinh Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot Proceedings Article In: International Conference on System Theory, Control and Computing (ICSTCC), 2023, (October 11-13, 2023, Timisoara, Romania.). Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, Reinforcement Learning @inproceedings{Yadav2023b, title = {Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot}, author = {Harsh Yadav and Honghu Xue and Yan Rudall and Mohamed Bakr and Benedikt Hein and Elmar Rueckert and Ngoc Thinh Nguyen}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR}, year = {2023}, date = {2023-06-26}, urldate = {2023-06-26}, publisher = { International Conference on System Theory, Control and Computing (ICSTCC)}, note = {October 11-13, 2023, Timisoara, Romania.}, keywords = {Autonomous Navigation, Deep Learning, Reinforcement Learning}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR Close
2022
Xue, Honghu; Song, Rui; Petzold, Julian; Hein, Benedikt; Hamann, Heiko; Rueckert, Elmar End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments Proceedings Article In: International Conference on Humanoid Robots (Humanoids 2022), 2022. Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation @inproceedings{Xue2022b, title = {End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments}, author = {Honghu Xue and Rui Song and Julian Petzold and Benedikt Hein and Heiko Hamann and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4}, year = {2022}, date = {2022-09-26}, urldate = {2022-09-26}, publisher = {International Conference on Humanoid Robots (Humanoids 2022)}, abstract = {We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm.}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation}, pubstate = {published}, tppubtype = {inproceedings} } Close We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm. Close https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4 Close

Compact List without Images

Proceedings Articles

Nwankwo, Linus; Ellensohn, Bjoern; Dave, Vedant; Hofer, Peter; Forstner, Jan; Villneuve, Marlene; Galler, Robert; Rueckert, Elmar

EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Proceedings Article

In: IEEE International Conference on Robotics and Automation (ICRA 2025)., 2025.

Links | BibTeX

Nwankwo, Linus; Rueckert, Elmar

The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article

In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ).

Abstract | Links | BibTeX

@inproceedings{Nwankwo2024,

title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language},

author = {Linus Nwankwo and Elmar Rueckert},

url = {https://doi.org/10.1145/3610978.3640723

https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs},

doi = {10.1145/3610978.3640723},

isbn = {9798400703232},

year  = {2024},

date = {2024-01-16},

urldate = {2024-01-16},

booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.},

pages = {808–812},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

organization = {ACM/IEEE},

series = {HRI '24},

abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.},

howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)},

key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction},

note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 },

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Ngoc Thinh

Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot Proceedings Article

In: International Conference on System Theory, Control and Computing (ICSTCC), 2023, (October 11-13, 2023, Timisoara, Romania.).

Links | BibTeX

Xue, Honghu; Song, Rui; Petzold, Julian; Hein, Benedikt; Hamann, Heiko; Rueckert, Elmar

End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments Proceedings Article

In: International Conference on Humanoid Robots (Humanoids 2022), 2022.

Abstract | Links | BibTeX