Publications - Chair of Cyber-Physical-Systems

Publication List with Images

2024

Nwankwo, Linus; Rueckert, Elmar

Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models Workshop

2024, ( In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA).

Abstract | Links | BibTeX | Tags: Autonomous Navigation, Human-Robot Interaction, Large Language Models, mobile navigation

Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models

Nwankwo, Linus; Rueckert, Elmar

The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article

In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ).

Abstract | Links | BibTeX | Tags: Autonomous Navigation, Large Language Models

@inproceedings{Nwankwo2024,

title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language},

author = {Linus Nwankwo and Elmar Rueckert},

url = {https://doi.org/10.1145/3610978.3640723

https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs},

doi = {10.1145/3610978.3640723},

isbn = {9798400703232},

year  = {2024},

date = {2024-01-16},

urldate = {2024-01-16},

booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.},

pages = {808–812},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

organization = {ACM/IEEE},

series = {HRI '24},

abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.},

howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)},

key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction},

note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 },

keywords = {Autonomous Navigation, Large Language Models},

pubstate = {published},

tppubtype = {inproceedings}

}

The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language

Compact List without Images

Proceedings Articles

Nwankwo, Linus; Rueckert, Elmar

The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article

Abstract | Links | BibTeX

@inproceedings{Nwankwo2024,

title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language},

author = {Linus Nwankwo and Elmar Rueckert},

url = {https://doi.org/10.1145/3610978.3640723

https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs},

doi = {10.1145/3610978.3640723},

isbn = {9798400703232},

year  = {2024},

date = {2024-01-16},

urldate = {2024-01-16},

booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.},

pages = {808–812},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

organization = {ACM/IEEE},

series = {HRI '24},

abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.},

howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)},

key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction},

note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 },

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Workshops

Nwankwo, Linus; Rueckert, Elmar

Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models Workshop

2024, ( In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA).

Abstract | Links | BibTeX