Publications | ROB.AI-Lab.Science | Chair of Cyber-Physical-Systems

Publication List with Images

2025
Nwankwo, Linus; Ellensohn, Bjoern; Dave, Vedant; Hofer, Peter; Forstner, Jan; Villneuve, Marlene; Galler, Robert; Rueckert, Elmar EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Proceedings Article In: IEEE International Conference on Robotics and Automation (ICRA 2025)., 2025. Links \| BibTeX \| Tags: Autonomous Navigation, robotics, SLAM @inproceedings{Nwankwo2025, title = {EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments}, author = {Linus Nwankwo and Bjoern Ellensohn and Vedant Dave and Peter Hofer and Jan Forstner and Marlene Villneuve and Robert Galler and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo}, year = {2025}, date = {2025-01-27}, urldate = {2025-01-27}, booktitle = {IEEE International Conference on Robotics and Automation (ICRA 2025).}, keywords = {Autonomous Navigation, robotics, SLAM}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo Close
2024
Nwankwo, Linus; Rueckert, Elmar Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models Workshop 2024, ( In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Human-Robot Interaction, Large Language Models, mobile navigation @workshop{Nwankwo2024MultimodalHA, title = {Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models}, author = {Linus Nwankwo and Elmar Rueckert}, url = {https://human-llm-interaction.github.io/workshop/hri24/papers/hllmi24_paper_5.pdf}, year = {2024}, date = {2024-03-11}, urldate = {2024-03-11}, abstract = {In this paper, we extended the method proposed in [17] to enable humans to interact naturally with autonomous agents through vocal and textual conversations. Our extended method exploits the inherent capabilities of pre-trained large language models (LLMs), multimodal visual language models (VLMs), and speech recognition (SR) models to decode the high-level natural language conversations and semantic understanding of the robot's task environment, and abstract them to the robot's actionable commands or queries. We performed a quantitative evaluation of our framework's natural vocal conversation understanding with participants from different racial backgrounds and English language accents. The participants interacted with the robot using both vocal and textual instructional commands. Based on the logged interaction data, our framework achieved 87.55% vocal commands decoding accuracy, 86.27% commands execution success, and an average latency of 0.89 seconds from receiving the participants' vocal chat commands to initiating the robot’s actual physical action. The video demonstrations of this paper can be found at https://linusnep.github.io/MTCC-IRoNL/}, note = { In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA}, keywords = {Autonomous Navigation, Human-Robot Interaction, Large Language Models, mobile navigation}, pubstate = {published}, tppubtype = {workshop} } Close In this paper, we extended the method proposed in [17] to enable humans to interact naturally with autonomous agents through vocal and textual conversations. Our extended method exploits the inherent capabilities of pre-trained large language models (LLMs), multimodal visual language models (VLMs), and speech recognition (SR) models to decode the high-level natural language conversations and semantic understanding of the robot's task environment, and abstract them to the robot's actionable commands or queries. We performed a quantitative evaluation of our framework's natural vocal conversation understanding with participants from different racial backgrounds and English language accents. The participants interacted with the robot using both vocal and textual instructional commands. Based on the logged interaction data, our framework achieved 87.55% vocal commands decoding accuracy, 86.27% commands execution success, and an average latency of 0.89 seconds from receiving the participants' vocal chat commands to initiating the robot’s actual physical action. The video demonstrations of this paper can be found at https://linusnep.github.io/MTCC-IRoNL/ Close https://human-llm-interaction.github.io/workshop/hri24/papers/hllmi24_paper_5.pd[…] Close
Nwankwo, Linus; Rueckert, Elmar The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Large Language Models @inproceedings{Nwankwo2024, title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language}, author = {Linus Nwankwo and Elmar Rueckert}, url = {https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs}, doi = {10.1145/3610978.3640723}, isbn = {9798400703232}, year = {2024}, date = {2024-01-16}, urldate = {2024-01-16}, booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.}, pages = {808–812}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, organization = {ACM/IEEE}, series = {HRI '24}, abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.}, howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)}, key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction}, note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 }, keywords = {Autonomous Navigation, Large Language Models}, pubstate = {published}, tppubtype = {inproceedings} } Close In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository. Close https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs doi:10.1145/3610978.3640723 Close
2023
Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Ngoc Thinh Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot Proceedings Article In: International Conference on System Theory, Control and Computing (ICSTCC), 2023, (October 11-13, 2023, Timisoara, Romania.). Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, Reinforcement Learning @inproceedings{Yadav2023b, title = {Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot}, author = {Harsh Yadav and Honghu Xue and Yan Rudall and Mohamed Bakr and Benedikt Hein and Elmar Rueckert and Ngoc Thinh Nguyen}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR}, year = {2023}, date = {2023-06-26}, urldate = {2023-06-26}, publisher = { International Conference on System Theory, Control and Computing (ICSTCC)}, note = {October 11-13, 2023, Timisoara, Romania.}, keywords = {Autonomous Navigation, Deep Learning, Reinforcement Learning}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR Close
Nwankwo, Linus; Fritze, Clemens; Bartsch, Konrad; Rueckert, Elmar ROMR: A ROS-based Open-source Mobile Robot Journal Article In: HardwareX, vol. 15, pp. 1–29, 2023. Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, mobile navigation, SLAM @article{Nwankwo2023b, title = {ROMR: A ROS-based Open-source Mobile Robot}, author = {Linus Nwankwo and Clemens Fritze and Konrad Bartsch and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/8aXLXXPFAZ4wq54}, doi = {10.1016/j.ohx.2023.e00426}, year = {2023}, date = {2023-04-17}, urldate = {2023-04-17}, journal = {HardwareX}, volume = {15}, pages = {1--29}, abstract = {Currently, commercially available intelligent transport robots that are capable of carrying up to 90kg of load can cost $5,000 or even more. This makes real-world experimentation prohibitively expensive, and limiting the applicability of such systems to everyday home or industrial tasks. Aside from their high cost, the majority of commercially available platforms are either closed-source, platform-specific, or use difficult-to-customize hardware and firmware. In this work, we present a low-cost, open-source and modular alternative, referred to herein as ”ROS-based open-source mobile robot (ROMR)”. ROMR utilizes off-the-shelf (OTS) components, additive manufacturing technologies, aluminium profiles, and a consumer hoverboard with high-torque brushless direct current (BLDC) motors. ROMR is fully compatible with the robot operating system (ROS), has a maximum payload of 90kg, and costs less than $1500. Furthermore, ROMR offers a simple yet robust framework for contextualizing simultaneous localization and mapping (SLAM) algorithms, an essential prerequisite for autonomous robot navigation. The robustness and performance of the ROMR were validated through realworld and simulation experiments. All the design, construction and software files are freely available online under the GNU GPL v3 license at https://doi.org/10.17605/OSF.IO/K83X7. A descriptive video of ROMR can be found at https://osf.io/ku8ag.}, keywords = {Autonomous Navigation, mobile navigation, SLAM}, pubstate = {published}, tppubtype = {article} } Close Currently, commercially available intelligent transport robots that are capable of carrying up to 90kg of load can cost $5,000 or even more. This makes real-world experimentation prohibitively expensive, and limiting the applicability of such systems to everyday home or industrial tasks. Aside from their high cost, the majority of commercially available platforms are either closed-source, platform-specific, or use difficult-to-customize hardware and firmware. In this work, we present a low-cost, open-source and modular alternative, referred to herein as ”ROS-based open-source mobile robot (ROMR)”. ROMR utilizes off-the-shelf (OTS) components, additive manufacturing technologies, aluminium profiles, and a consumer hoverboard with high-torque brushless direct current (BLDC) motors. ROMR is fully compatible with the robot operating system (ROS), has a maximum payload of 90kg, and costs less than $1500. Furthermore, ROMR offers a simple yet robust framework for contextualizing simultaneous localization and mapping (SLAM) algorithms, an essential prerequisite for autonomous robot navigation. The robustness and performance of the ROMR were validated through realworld and simulation experiments. All the design, construction and software files are freely available online under the GNU GPL v3 license at https://doi.org/10.17605/OSF.IO/K83X7. A descriptive video of ROMR can be found at https://osf.io/ku8ag. Close https://cloud.cps.unileoben.ac.at/index.php/s/8aXLXXPFAZ4wq54 doi:10.1016/j.ohx.2023.e00426 Close
Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Thinh Deep Reinforcement Learning for Autonomous Navigation in Intralogistics Workshop 2023, (European Control Conference (ECC) Workshop, Extended Abstract.). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation, SLAM @workshop{Yadav2023, title = {Deep Reinforcement Learning for Autonomous Navigation in Intralogistics}, author = {Harsh Yadav and Honghu Xue and Yan Rudall and Mohamed Bakr and Benedikt Hein and Elmar Rueckert and Thinh Nguyen}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/tw4D43WTzG6yLmE}, year = {2023}, date = {2023-03-10}, urldate = {2023-03-10}, abstract = {Even with several advances in autonomous mobile robots, navigation in a highly dynamic environment still remains a challenge. Classical navigation systems, such as Simultaneous Localization and Mapping (SLAM), build a map of the environment and constructing maps of highly dynamic environments is impractical. Deep Reinforcement Learning (DRL) approaches have the ability to learn policies without knowledge of the maps or the transition models of the environment. The aim of our work is to investigate the potential of using DRL to control an autonomous mobile robot to dock with a load carrier. This paper presents an initial successful training result of the Soft Actor-Critic (SAC) algorithm, which can navigate a robot toward an open door only based on the 360° LiDAR observations. Ongoing work is using visual sensors for load carrier docking.}, howpublished = {European Control Conference (ECC)}, note = {European Control Conference (ECC) Workshop, Extended Abstract.}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation, SLAM}, pubstate = {published}, tppubtype = {workshop} } Close Even with several advances in autonomous mobile robots, navigation in a highly dynamic environment still remains a challenge. Classical navigation systems, such as Simultaneous Localization and Mapping (SLAM), build a map of the environment and constructing maps of highly dynamic environments is impractical. Deep Reinforcement Learning (DRL) approaches have the ability to learn policies without knowledge of the maps or the transition models of the environment. The aim of our work is to investigate the potential of using DRL to control an autonomous mobile robot to dock with a load carrier. This paper presents an initial successful training result of the Soft Actor-Critic (SAC) algorithm, which can navigate a robot toward an open door only based on the 360° LiDAR observations. Ongoing work is using visual sensors for load carrier docking. Close https://cloud.cps.unileoben.ac.at/index.php/s/tw4D43WTzG6yLmE Close
2022
Xue, Honghu; Song, Rui; Petzold, Julian; Hein, Benedikt; Hamann, Heiko; Rueckert, Elmar End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments Proceedings Article In: International Conference on Humanoid Robots (Humanoids 2022), 2022. Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation @inproceedings{Xue2022b, title = {End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments}, author = {Honghu Xue and Rui Song and Julian Petzold and Benedikt Hein and Heiko Hamann and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4}, year = {2022}, date = {2022-09-26}, urldate = {2022-09-26}, publisher = {International Conference on Humanoid Robots (Humanoids 2022)}, abstract = {We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm.}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation}, pubstate = {published}, tppubtype = {inproceedings} } Close We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm. Close https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4 Close
Rottmann, Nils; Studt, Nico; Ernst, Floris; Rueckert, Elmar ROS-Mobile: An Android™ application for the Robot Operating System Journal Article In: Arxiv, 2022. Links \| BibTeX \| Tags: Autonomous Navigation, mobile navigation, Simulation @article{Rottmann2022, title = {ROS-Mobile: An Android™ application for the Robot Operating System}, author = {Nils Rottmann and Nico Studt and Floris Ernst and Elmar Rueckert}, url = {https://arxiv.org/pdf/2011.02781.pdf}, doi = {10.48550/arXiv.2011.02781}, year = {2022}, date = {2022-09-01}, urldate = {2022-09-01}, journal = {Arxiv}, keywords = {Autonomous Navigation, mobile navigation, Simulation}, pubstate = {published}, tppubtype = {article} } Close https://arxiv.org/pdf/2011.02781.pdf doi:10.48550/arXiv.2011.02781 Close
Xue, Honghu; Hein, Benedikt; Bakr, Mohamed; Schildbach, Georg; Abel, Bengt; Rueckert, Elmar Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics Journal Article In: Applied Sciences (MDPI), Special Issue on Intelligent Robotics, 2022, (Supplement: https://cloud.cps.unileoben.ac.at/index.php/s/Sj68rQewnkf4ppZ). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation, Reinforcement Learning @article{Xue2022, title = {Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics}, author = {Honghu Xue and Benedikt Hein and Mohamed Bakr and Georg Schildbach and Bengt Abel and Elmar Rueckert}, editor = {/}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/yddDZ7z9oqxenCi }, year = {2022}, date = {2022-01-31}, urldate = {2022-01-31}, journal = {Applied Sciences (MDPI), Special Issue on Intelligent Robotics}, abstract = {We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. The automatic guided vehicle is equipped with LiDAR and frontal RGB sensors and learns to reach underneath the target dolly. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we proposed NavACL-Q as an automatic curriculum learning together with distributed soft actor-critic. The performance of the learning algorithm is evaluated exhaustively in a different warehouse environment to check both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms the map-based navigation pipeline provided by NVIDIA Isaac Sim in terms of higher agent-goal distances and relative orientations. The ablation studies also confirmed that NavACL-Q greatly facilitates the whole learning process and a pre-trained feature extractor manifestly boosts the training speed.}, note = {Supplement: https://cloud.cps.unileoben.ac.at/index.php/s/Sj68rQewnkf4ppZ}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation, Reinforcement Learning}, pubstate = {published}, tppubtype = {article} } Close We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. The automatic guided vehicle is equipped with LiDAR and frontal RGB sensors and learns to reach underneath the target dolly. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we proposed NavACL-Q as an automatic curriculum learning together with distributed soft actor-critic. The performance of the learning algorithm is evaluated exhaustively in a different warehouse environment to check both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms the map-based navigation pipeline provided by NVIDIA Isaac Sim in terms of higher agent-goal distances and relative orientations. The ablation studies also confirmed that NavACL-Q greatly facilitates the whole learning process and a pre-trained feature extractor manifestly boosts the training speed. Close https://cloud.cps.unileoben.ac.at/index.php/s/yddDZ7z9oqxenCi Close

Compact List without Images

Journal Articles

Nwankwo, Linus; Fritze, Clemens; Bartsch, Konrad; Rueckert, Elmar

ROMR: A ROS-based Open-source Mobile Robot Journal Article

In: HardwareX, vol. 15, pp. 1–29, 2023.

Abstract | Links | BibTeX

Rottmann, Nils; Studt, Nico; Ernst, Floris; Rueckert, Elmar

ROS-Mobile: An Android™ application for the Robot Operating System Journal Article

In: Arxiv, 2022.

Links | BibTeX

Xue, Honghu; Hein, Benedikt; Bakr, Mohamed; Schildbach, Georg; Abel, Bengt; Rueckert, Elmar

Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics Journal Article

In: Applied Sciences (MDPI), Special Issue on Intelligent Robotics, 2022, (Supplement: https://cloud.cps.unileoben.ac.at/index.php/s/Sj68rQewnkf4ppZ).

Abstract | Links | BibTeX

Proceedings Articles

Nwankwo, Linus; Ellensohn, Bjoern; Dave, Vedant; Hofer, Peter; Forstner, Jan; Villneuve, Marlene; Galler, Robert; Rueckert, Elmar

EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Proceedings Article

In: IEEE International Conference on Robotics and Automation (ICRA 2025)., 2025.

Links | BibTeX

Nwankwo, Linus; Rueckert, Elmar

The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article

In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ).

Abstract | Links | BibTeX

@inproceedings{Nwankwo2024,

title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language},

author = {Linus Nwankwo and Elmar Rueckert},

url = {https://doi.org/10.1145/3610978.3640723

https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs},

doi = {10.1145/3610978.3640723},

isbn = {9798400703232},

year  = {2024},

date = {2024-01-16},

urldate = {2024-01-16},

booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.},

pages = {808–812},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

organization = {ACM/IEEE},

series = {HRI '24},

abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.},

howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)},

key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction},

note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 },

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Ngoc Thinh

Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot Proceedings Article

In: International Conference on System Theory, Control and Computing (ICSTCC), 2023, (October 11-13, 2023, Timisoara, Romania.).

Links | BibTeX

Xue, Honghu; Song, Rui; Petzold, Julian; Hein, Benedikt; Hamann, Heiko; Rueckert, Elmar

End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments Proceedings Article

In: International Conference on Humanoid Robots (Humanoids 2022), 2022.

Abstract | Links | BibTeX

Workshops

Nwankwo, Linus; Rueckert, Elmar

Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models Workshop

2024, ( In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA).

Abstract | Links | BibTeX

Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Thinh

Deep Reinforcement Learning for Autonomous Navigation in Intralogistics Workshop

2023, (European Control Conference (ECC) Workshop, Extended Abstract.).

Abstract | Links | BibTeX

2025
Nwankwo, Linus; Ellensohn, Bjoern; Dave, Vedant; Hofer, Peter; Forstner, Jan; Villneuve, Marlene; Galler, Robert; Rueckert, Elmar EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Proceedings Article In: IEEE International Conference on Robotics and Automation (ICRA 2025)., 2025. Links \| BibTeX \| Tags: Autonomous Navigation, robotics, SLAM @inproceedings{Nwankwo2025, title = {EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments}, author = {Linus Nwankwo and Bjoern Ellensohn and Vedant Dave and Peter Hofer and Jan Forstner and Marlene Villneuve and Robert Galler and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo}, year = {2025}, date = {2025-01-27}, urldate = {2025-01-27}, booktitle = {IEEE International Conference on Robotics and Automation (ICRA 2025).}, keywords = {Autonomous Navigation, robotics, SLAM}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/MawgtYSbTBoNBZo Close
2024
Nwankwo, Linus; Rueckert, Elmar Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models Workshop 2024, ( In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Human-Robot Interaction, Large Language Models, mobile navigation @workshop{Nwankwo2024MultimodalHA, title = {Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation Models}, author = {Linus Nwankwo and Elmar Rueckert}, url = {https://human-llm-interaction.github.io/workshop/hri24/papers/hllmi24_paper_5.pdf}, year = {2024}, date = {2024-03-11}, urldate = {2024-03-11}, abstract = {In this paper, we extended the method proposed in [17] to enable humans to interact naturally with autonomous agents through vocal and textual conversations. Our extended method exploits the inherent capabilities of pre-trained large language models (LLMs), multimodal visual language models (VLMs), and speech recognition (SR) models to decode the high-level natural language conversations and semantic understanding of the robot's task environment, and abstract them to the robot's actionable commands or queries. We performed a quantitative evaluation of our framework's natural vocal conversation understanding with participants from different racial backgrounds and English language accents. The participants interacted with the robot using both vocal and textual instructional commands. Based on the logged interaction data, our framework achieved 87.55% vocal commands decoding accuracy, 86.27% commands execution success, and an average latency of 0.89 seconds from receiving the participants' vocal chat commands to initiating the robot’s actual physical action. The video demonstrations of this paper can be found at https://linusnep.github.io/MTCC-IRoNL/}, note = { In Workshop of the 2024 ACM/IEEE International Conference on HumanRobot Interaction (HRI ’24 Workshop), March 11–14, 2024, Boulder, CO, USA. ACM, New York, NY, USA}, keywords = {Autonomous Navigation, Human-Robot Interaction, Large Language Models, mobile navigation}, pubstate = {published}, tppubtype = {workshop} } Close In this paper, we extended the method proposed in [17] to enable humans to interact naturally with autonomous agents through vocal and textual conversations. Our extended method exploits the inherent capabilities of pre-trained large language models (LLMs), multimodal visual language models (VLMs), and speech recognition (SR) models to decode the high-level natural language conversations and semantic understanding of the robot's task environment, and abstract them to the robot's actionable commands or queries. We performed a quantitative evaluation of our framework's natural vocal conversation understanding with participants from different racial backgrounds and English language accents. The participants interacted with the robot using both vocal and textual instructional commands. Based on the logged interaction data, our framework achieved 87.55% vocal commands decoding accuracy, 86.27% commands execution success, and an average latency of 0.89 seconds from receiving the participants' vocal chat commands to initiating the robot’s actual physical action. The video demonstrations of this paper can be found at https://linusnep.github.io/MTCC-IRoNL/ Close https://human-llm-interaction.github.io/workshop/hri24/papers/hllmi24_paper_5.pd[…] Close
Nwankwo, Linus; Rueckert, Elmar The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language Proceedings Article In: HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction., pp. 808–812, ACM/IEEE Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 9798400703232, (Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 ). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Large Language Models @inproceedings{Nwankwo2024, title = {The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language}, author = {Linus Nwankwo and Elmar Rueckert}, url = {https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs}, doi = {10.1145/3610978.3640723}, isbn = {9798400703232}, year = {2024}, date = {2024-01-16}, urldate = {2024-01-16}, booktitle = {HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction.}, pages = {808–812}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, organization = {ACM/IEEE}, series = {HRI '24}, abstract = {In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository.}, howpublished = {ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24 Companion)}, key = {ChatGPT, LLMs, ROS, VLMs, autonomous robots, human-robot interaction, natural language interaction}, note = {Published as late breaking results. Supplementary video: https://cloud.cps.unileoben.ac.at/index.php/s/fRE9XMosWDtJ339 }, keywords = {Autonomous Navigation, Large Language Models}, pubstate = {published}, tppubtype = {inproceedings} } Close In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository. Close https://doi.org/10.1145/3610978.3640723 https://cloud.cps.unileoben.ac.at/index.php/s/YzJdHWDt9ZdqsZs doi:10.1145/3610978.3640723 Close
2023
Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Ngoc Thinh Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot Proceedings Article In: International Conference on System Theory, Control and Computing (ICSTCC), 2023, (October 11-13, 2023, Timisoara, Romania.). Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, Reinforcement Learning @inproceedings{Yadav2023b, title = {Deep Reinforcement Learning for Mapless Navigation of Autonomous Mobile Robot}, author = {Harsh Yadav and Honghu Xue and Yan Rudall and Mohamed Bakr and Benedikt Hein and Elmar Rueckert and Ngoc Thinh Nguyen}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR}, year = {2023}, date = {2023-06-26}, urldate = {2023-06-26}, publisher = { International Conference on System Theory, Control and Computing (ICSTCC)}, note = {October 11-13, 2023, Timisoara, Romania.}, keywords = {Autonomous Navigation, Deep Learning, Reinforcement Learning}, pubstate = {published}, tppubtype = {inproceedings} } Close https://cloud.cps.unileoben.ac.at/index.php/s/zEnY3yoFHZRdzkR Close
Nwankwo, Linus; Fritze, Clemens; Bartsch, Konrad; Rueckert, Elmar ROMR: A ROS-based Open-source Mobile Robot Journal Article In: HardwareX, vol. 15, pp. 1–29, 2023. Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, mobile navigation, SLAM @article{Nwankwo2023b, title = {ROMR: A ROS-based Open-source Mobile Robot}, author = {Linus Nwankwo and Clemens Fritze and Konrad Bartsch and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/8aXLXXPFAZ4wq54}, doi = {10.1016/j.ohx.2023.e00426}, year = {2023}, date = {2023-04-17}, urldate = {2023-04-17}, journal = {HardwareX}, volume = {15}, pages = {1--29}, abstract = {Currently, commercially available intelligent transport robots that are capable of carrying up to 90kg of load can cost $5,000 or even more. This makes real-world experimentation prohibitively expensive, and limiting the applicability of such systems to everyday home or industrial tasks. Aside from their high cost, the majority of commercially available platforms are either closed-source, platform-specific, or use difficult-to-customize hardware and firmware. In this work, we present a low-cost, open-source and modular alternative, referred to herein as ”ROS-based open-source mobile robot (ROMR)”. ROMR utilizes off-the-shelf (OTS) components, additive manufacturing technologies, aluminium profiles, and a consumer hoverboard with high-torque brushless direct current (BLDC) motors. ROMR is fully compatible with the robot operating system (ROS), has a maximum payload of 90kg, and costs less than $1500. Furthermore, ROMR offers a simple yet robust framework for contextualizing simultaneous localization and mapping (SLAM) algorithms, an essential prerequisite for autonomous robot navigation. The robustness and performance of the ROMR were validated through realworld and simulation experiments. All the design, construction and software files are freely available online under the GNU GPL v3 license at https://doi.org/10.17605/OSF.IO/K83X7. A descriptive video of ROMR can be found at https://osf.io/ku8ag.}, keywords = {Autonomous Navigation, mobile navigation, SLAM}, pubstate = {published}, tppubtype = {article} } Close Currently, commercially available intelligent transport robots that are capable of carrying up to 90kg of load can cost $5,000 or even more. This makes real-world experimentation prohibitively expensive, and limiting the applicability of such systems to everyday home or industrial tasks. Aside from their high cost, the majority of commercially available platforms are either closed-source, platform-specific, or use difficult-to-customize hardware and firmware. In this work, we present a low-cost, open-source and modular alternative, referred to herein as ”ROS-based open-source mobile robot (ROMR)”. ROMR utilizes off-the-shelf (OTS) components, additive manufacturing technologies, aluminium profiles, and a consumer hoverboard with high-torque brushless direct current (BLDC) motors. ROMR is fully compatible with the robot operating system (ROS), has a maximum payload of 90kg, and costs less than $1500. Furthermore, ROMR offers a simple yet robust framework for contextualizing simultaneous localization and mapping (SLAM) algorithms, an essential prerequisite for autonomous robot navigation. The robustness and performance of the ROMR were validated through realworld and simulation experiments. All the design, construction and software files are freely available online under the GNU GPL v3 license at https://doi.org/10.17605/OSF.IO/K83X7. A descriptive video of ROMR can be found at https://osf.io/ku8ag. Close https://cloud.cps.unileoben.ac.at/index.php/s/8aXLXXPFAZ4wq54 doi:10.1016/j.ohx.2023.e00426 Close
Yadav, Harsh; Xue, Honghu; Rudall, Yan; Bakr, Mohamed; Hein, Benedikt; Rueckert, Elmar; Nguyen, Thinh Deep Reinforcement Learning for Autonomous Navigation in Intralogistics Workshop 2023, (European Control Conference (ECC) Workshop, Extended Abstract.). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation, SLAM @workshop{Yadav2023, title = {Deep Reinforcement Learning for Autonomous Navigation in Intralogistics}, author = {Harsh Yadav and Honghu Xue and Yan Rudall and Mohamed Bakr and Benedikt Hein and Elmar Rueckert and Thinh Nguyen}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/tw4D43WTzG6yLmE}, year = {2023}, date = {2023-03-10}, urldate = {2023-03-10}, abstract = {Even with several advances in autonomous mobile robots, navigation in a highly dynamic environment still remains a challenge. Classical navigation systems, such as Simultaneous Localization and Mapping (SLAM), build a map of the environment and constructing maps of highly dynamic environments is impractical. Deep Reinforcement Learning (DRL) approaches have the ability to learn policies without knowledge of the maps or the transition models of the environment. The aim of our work is to investigate the potential of using DRL to control an autonomous mobile robot to dock with a load carrier. This paper presents an initial successful training result of the Soft Actor-Critic (SAC) algorithm, which can navigate a robot toward an open door only based on the 360° LiDAR observations. Ongoing work is using visual sensors for load carrier docking.}, howpublished = {European Control Conference (ECC)}, note = {European Control Conference (ECC) Workshop, Extended Abstract.}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation, SLAM}, pubstate = {published}, tppubtype = {workshop} } Close Even with several advances in autonomous mobile robots, navigation in a highly dynamic environment still remains a challenge. Classical navigation systems, such as Simultaneous Localization and Mapping (SLAM), build a map of the environment and constructing maps of highly dynamic environments is impractical. Deep Reinforcement Learning (DRL) approaches have the ability to learn policies without knowledge of the maps or the transition models of the environment. The aim of our work is to investigate the potential of using DRL to control an autonomous mobile robot to dock with a load carrier. This paper presents an initial successful training result of the Soft Actor-Critic (SAC) algorithm, which can navigate a robot toward an open door only based on the 360° LiDAR observations. Ongoing work is using visual sensors for load carrier docking. Close https://cloud.cps.unileoben.ac.at/index.php/s/tw4D43WTzG6yLmE Close
2022
Xue, Honghu; Song, Rui; Petzold, Julian; Hein, Benedikt; Hamann, Heiko; Rueckert, Elmar End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments Proceedings Article In: International Conference on Humanoid Robots (Humanoids 2022), 2022. Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation @inproceedings{Xue2022b, title = {End-To-End Deep Reinforcement Learning for First-Person Pedestrian Visual Navigation in Urban Environments}, author = {Honghu Xue and Rui Song and Julian Petzold and Benedikt Hein and Heiko Hamann and Elmar Rueckert}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4}, year = {2022}, date = {2022-09-26}, urldate = {2022-09-26}, publisher = {International Conference on Humanoid Robots (Humanoids 2022)}, abstract = {We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm.}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation}, pubstate = {published}, tppubtype = {inproceedings} } Close We solve a visual navigation problem in an urban setting via deep reinforcement learning in an end-to-end manner. A major challenge of a first-person visual navigation problem lies in severe partial observability and sparse positive experiences of reaching the goal. To address partial observability, we propose a novel 3D-temporal convolutional network to encode sequential historical visual observations, its effectiveness is verified by comparing to a commonly-used frame-stacking approach. For sparse positive samples, we propose an improved automatic curriculum learning algorithm NavACL+, which proposes meaningful curricula starting from easy tasks and gradually generalizes to challenging ones. NavACL+ is shown to facilitate the learning process, greatly improve the task success rate on difficult tasks by at least 40% and offer enhanced generalization to different initial poses compared to training from a fixed initial pose and the original NavACL algorithm. Close https://cloud.cps.unileoben.ac.at/index.php/s/RzMQWqsFarQ6Kw4 Close
Rottmann, Nils; Studt, Nico; Ernst, Floris; Rueckert, Elmar ROS-Mobile: An Android™ application for the Robot Operating System Journal Article In: Arxiv, 2022. Links \| BibTeX \| Tags: Autonomous Navigation, mobile navigation, Simulation @article{Rottmann2022, title = {ROS-Mobile: An Android™ application for the Robot Operating System}, author = {Nils Rottmann and Nico Studt and Floris Ernst and Elmar Rueckert}, url = {https://arxiv.org/pdf/2011.02781.pdf}, doi = {10.48550/arXiv.2011.02781}, year = {2022}, date = {2022-09-01}, urldate = {2022-09-01}, journal = {Arxiv}, keywords = {Autonomous Navigation, mobile navigation, Simulation}, pubstate = {published}, tppubtype = {article} } Close https://arxiv.org/pdf/2011.02781.pdf doi:10.48550/arXiv.2011.02781 Close
Xue, Honghu; Hein, Benedikt; Bakr, Mohamed; Schildbach, Georg; Abel, Bengt; Rueckert, Elmar Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics Journal Article In: Applied Sciences (MDPI), Special Issue on Intelligent Robotics, 2022, (Supplement: https://cloud.cps.unileoben.ac.at/index.php/s/Sj68rQewnkf4ppZ). Abstract \| Links \| BibTeX \| Tags: Autonomous Navigation, Deep Learning, mobile navigation, Reinforcement Learning @article{Xue2022, title = {Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics}, author = {Honghu Xue and Benedikt Hein and Mohamed Bakr and Georg Schildbach and Bengt Abel and Elmar Rueckert}, editor = {/}, url = {https://cloud.cps.unileoben.ac.at/index.php/s/yddDZ7z9oqxenCi }, year = {2022}, date = {2022-01-31}, urldate = {2022-01-31}, journal = {Applied Sciences (MDPI), Special Issue on Intelligent Robotics}, abstract = {We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. The automatic guided vehicle is equipped with LiDAR and frontal RGB sensors and learns to reach underneath the target dolly. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we proposed NavACL-Q as an automatic curriculum learning together with distributed soft actor-critic. The performance of the learning algorithm is evaluated exhaustively in a different warehouse environment to check both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms the map-based navigation pipeline provided by NVIDIA Isaac Sim in terms of higher agent-goal distances and relative orientations. The ablation studies also confirmed that NavACL-Q greatly facilitates the whole learning process and a pre-trained feature extractor manifestly boosts the training speed.}, note = {Supplement: https://cloud.cps.unileoben.ac.at/index.php/s/Sj68rQewnkf4ppZ}, keywords = {Autonomous Navigation, Deep Learning, mobile navigation, Reinforcement Learning}, pubstate = {published}, tppubtype = {article} } Close We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. The automatic guided vehicle is equipped with LiDAR and frontal RGB sensors and learns to reach underneath the target dolly. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we proposed NavACL-Q as an automatic curriculum learning together with distributed soft actor-critic. The performance of the learning algorithm is evaluated exhaustively in a different warehouse environment to check both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms the map-based navigation pipeline provided by NVIDIA Isaac Sim in terms of higher agent-goal distances and relative orientations. The ablation studies also confirmed that NavACL-Q greatly facilitates the whole learning process and a pre-trained feature extractor manifestly boosts the training speed. Close https://cloud.cps.unileoben.ac.at/index.php/s/yddDZ7z9oqxenCi Close