BSc. Thesis, Merisa Salkic – Smart conversations: Enhancing robotic task execution through advanced language models

Supervisor: Linus Nwankwo, M.Sc.;
Univ.-Prof. Dr Elmar Rückert
Start date:  As soon as possible

 

Theoretical difficulty: mid
Practical difficulty: High

Abstract

In this thesis, we aim to enhance the method proposed in [1] for robust natural human-autonomous agent interaction through verbal and textual conversations. 

The primary focus would be to develop a system that can enhance the natural language conversations, understand the 

semantic  context of the robot’s task environment, and abstract this information into actionable commands or queries. This will be achieved by leveraging the capabilities of pre-trained large language models (LLMs) – GPT-4, visual language models (VLMs) – CLIP, and audio language models (ALMs) – AudioLM.

Tentative Work Plan

To achieve the objectives, the following concrete tasks will be focused on:

  • Initialisation and Background:
    • Study the concept of LLMs, VLMs, and ALMs.
    • How LLMs, VLMs, and ALMs can be grounded for autonomous robotic tasks.
    • Familiarise yourself with the methods at the project website – https://linusnep.github.io/MTCC-IRoNL/.
    •  
  • Setup and Familiarity with the Simulation Environment
    • Build a robot model (URDF) for the simulation (optional if you wish to use the existing one).
    • Set up the ROS framework for the simulation (Gazebo, Rviz).
    • Recommended programming tools: C++, Python, Matlab.
    •  
  • Coding
    • Improve the existing code of the method proposed in [1] to incorporate the aforementioned modalities—the code to be provided to the student.
    • Integrate other LLMs e.g., LLaMA and VLMs e.g., GLIP modalities into the framework and compare their performance with the baseline (GPT-4 and CLIP).
    •  
  • Intermediate Presentation:
    • Present the results of your background study or what you must have done so far.
    • Detailed planning of the next steps.
    •  
  • Simulation & Real-World Testing (If Possible):
    • Test your implemented model with a Gazebo-simulated quadruped or differential drive robot.
    • Perform the real-world testing of the developed framework with our Unitree Go1 quadruped robot or with our Segway RMP 220 Lite robot.
    • Analyse and compare the model’s performance in real-world scenarios versus simulations with the different LLMs and VLMs pipelines.
    •  
  • Optimize the Framework for Optimal Performance and Efficiency (Optional):
    • Validate the model to identify bottlenecks within the robot’s task environment.
    •  
  • Documentation and Thesis Writing:
    • Document the entire process, methodologies, and tools used.
    • Analyse and interpret the results.
    • Draft the project report or thesis, ensuring that the primary objectives are achieved.
    •  
  • Research Paper Writing (optional)
    •  

Related Work

[1]  Linus Nwankwo and Elmar Rueckert. 2024. The Conversation is the Command: Interacting with Real-World Autonomous Robots Through Natural Language. In Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction (HRI ’24). Association for Computing Machinery, New York, NY, USA, 808–812. https://doi.org/10.1145/3610978.3640723.

[2]  Nwankwo, L., & Rueckert, E. (2024). Multimodal Human-Autonomous Agents Interaction Using Pre-Trained Language and Visual Foundation ModelsarXiv preprint arXiv:2403.12273.




After Business Trip Paperwork

New Obligation: Submit along comparitive offers

(This starts from 30 July 2024)

You are required to submit a comparative analysis of the prices for your trip along with other documents for claims.

Documents to submit and print in hard copy:

  • Conference/summer school schedule
  • Transport ticket (flight/intercity train/city train/bus)
  • Registration fee
  • Spesenabrechnung/Reisekostennachweis (from SAP)
  • Accomodation
  • Comparative analysis of the prices (only be paid 50% of the flight costs, if this document is not submitted along)

New Obligation: Monthly data entry for öbb tickets

(This starts from 26 Jun 2024)

You can find the entry form at here: https://cloud.cps.unileoben.ac.at/index.php/s/GTFTrT8btK7mMtW

Procedure to submit paperwork to Financial Department

Published on 21 May 2024

Update 1 on 26 Jun 2024

Update 2 on 30 July 2024

1. Login into SAP

At your SAP, click on “Meine Reisen und Spesen”.

2. Click on your desired Trip

In my case, I will show example in Austria.

 

Click “Welter” to proceed.

3. At the Main Page with 4 steps

Step 1: Verify every information especially Kontierung (Your project number)

Next, click on checkbox with * and then proceed with “Belege erfassen”

Step 2: Add all related claims

Step 3: Validate

There are two options: Save it for future or Sent it to financial department


4. Final step

  • Prepare all the original receipts and keep a copy with you.
  • Print out the above from system
  • Put the documents at “Dienstreisen Folder” at Regina’s place
  • Bring the folder to Uni Post Office at 1st floor of old building.




Print a Poster

Kindly ask for permission before proceed to poster printing.

 

To print a poster, you can either go for:

Option 1: Mail Boxes Leoben

Price list:

A0: ~20.00 euro

CPS account: KST 101900

Email them, and they will record at CPS account.


Option 2: ÖH Leoben

Fill the form at : https://www.oeh-leoben.at/de/plotauftrag

Price list:

A0: ~6.63 euro

A1: ~3.35 euro

Only cash payment, and pay it when obtaining.




Zeitungsinterview Kosmo

Unser Lehrling Kosmo Obermayer berichtet über seinen Lehrberuf als Informationstechnologe mit Betriebstechnik und gibt spannnende Einblicke in seinen Alltag am Lehrstuhl für Cyber-Physical-Systems.

Der Artikel ist unter diesem Link erreichbar.

Ein fröhlicher Kosmo




B.Sc. Thesis: Reineke Peter on Deep Learning for Predicting Fluid Dynamics

Supervisor: Univ.-Prof. Dr Elmar Rückert

Project: K1-MET P3.4
Start date: 1st of May 2024

Theoretical difficulty: high
Practical difficulty: mid

Topic

The the steel production, the steel quality heavily depends on the dynamic processes of the meniscus level fluctuations in the mold. These complex dynamic  processes can be observed using IR cameras observing the surface level and the casting powder temperature. 

The goal of this thesis is to develop and compare deep learning approaches (CNNs, transformers) for predicting fluid dynamics in lab prototype environment. 

Tasks

  • Literature research of state of the art, see references
  • Lab prototype environment for generating complex (structured and chaotic) fluid dynamics
  • Dataset recording, visualization and annotation
  • Deep Learning algorithm implementation (CNNs & Transformers)
  • Evaluation on different datasets (predictable dynamics, complex dynamics, synchronous and async. surface level dynamics, chaotic dynamics).
  • Thesis writing.

References




B.Sc. Thesis: Sukal Tanja on Creating a Python development environment for LEGO Ev3 robot systems

Supervisor: Univ.-Prof. Dr Elmar Rückert

Start date: 1st of August 2023

Theoretical difficulty: mid
Practical difficulty: mid

Thema der Arbeit

LEGO Ev3 Robotersysteme werden am Lehrstuhl in der Lehre eingesetzt, um einen einfachen Einstieg in die Robotik zu ermöglichen. Zahlreiche Algorithmen können erprobt werden: 

  • Pfadplanung und Navigation
  • Kalman Filter
  • Kartierung / SLAM
  • Objektmanipulation 
  • Kamerabasierte Objekterkennung
  • Regelungsalgorithmen
  • Telemetrieaufgaben
  • usw.

Unsere EV3 Systeme sind mit einem Linux Betriebsystem (https://www.ev3dev.org) ausgestattet und können in der Programmiersprache Micro-Python bespielt werden. 

Ziel dieser Arbeit ist es eine Entwicklungsumgebung für die klassische Python Programmiersprache zu schaffen. Dabei sollen Beispielprojekte umgesetzt und Limitierungen dokumentiert werden. 

Aufgaben

  • Recherche und Dokumentation zur State-of-the-Art
  • Beispielprojekte Implementieren
  • Git Repository mit Dokumentation erstellen.
  • Dokumentation der Arbeitsschritte & Verfassen der Diplomarbeit

Wissenschaftlicher Beitrag

  • Entwicklung und Implementierung einer Entwicklungsumgebung in Python für die Lehre. 
  • Veröffentlichung des Source Codes. 

Abschlussarbeit

B.Sc. Thesis by Tanja Sukal on Open-Source LEGO EV3 Python Framework for Teaching, 2024.




M.Sc. Thesis: Einberger Stefan on Retrofitting of a Cyber-Physical System to a reactive molding machine for thermoset resins

Supervisor: Univ.-Prof. Dr Elmar Rückert

Company: Ottronic GmbH
Start date: 1st of October 2023

Theoretical difficulty: mid
Practical difficulty: mid

Thema der Arbeit

Bei Ottronic bildet die Verkapselung unserer Elektroniken und Motoren mittels eines eigens adaptierten Reactive Injection Molding (RIM) Verfahren die Grundlage für die Produktion von medienresistenten Elektroniken und elektrischen Hochleistungsantrieben. Im Zuge des RIM werden sogenannte b-staged Duroplaste unter einem präzisen Druck- und Temperaturprofil verarbeitet, geformt und final ausgehärtet. Um die angestrebte höchste Produktqualität, mit Blick auf Medizintechnik-Applikationen, zu gewährleisten, muss dieser Prozess mit jedem Schuss optimal eingestellt sein. Daher soll ein Cyber Physical System (CPS) entwickelt werden, welches auf unsere RIM-Anlagen nachgerüstet wird.

Ziel der Diplomarbeit ist, dass das finale CPS selbstständig Prozessschwankungen (Chargenschwankungen, Hallen-/Maschinentemperatur, Feuchte, etc.) erkennen kann und Regelparameter des Spritzgussvorgangs (Schmelzzeit, Aushärtezeit, Einpresskraft, etc.) anpassen um eine gleichbleibende Produktqualität ohne menschliche Kontrolle zu garantieren.

Dabei soll in einem ersten Schritt der aktuelle Prozess beschrieben werden. Daraus sollen die notwendigen Produktionsparameter abgeleitet und deren Auswirkungen auf den Prozess analysiert sowie die Kernpunkte zur Prozessoptimierung definiert werden, sowie ein Modell dafür entwickelt werden. Das zu entwickelnde Modell dient dann in weitere Folge als Basis für das CPS um den Prozess zu bewerten zu können, Abweichungen zu detektieren und Regelparameter abzuändern.

In weiterer Folge soll das Modell auf der Maschinensteuerung CPS integriert und implementiert werden. Abschließend muss noch die Verknüpfung der neu gewonnen Intelligenz des CPS mit der aktuellen Regelung der Maschine verknüpft werden um eine nahezu autonome Prozessführung zu garantieren, sowie eine neue ressourcen- als auch energieeffizientere Spritzpressmethode garantieren.

Aufgaben

  • Recherche und Dokumentation zur State-of-the-Art
  • Erfassung aller notwendigen Produktionsparameter sowie Analyse derer auf Produktqualität
  • Entwicklung eines Modelles zur virtuellen Beschreibung des Spitzgussprozesses Implementierung des Modells auf Maschinensteuerung als Basis für ein CPS
  • Identifizierungen von Abweichungen im Prozess sowie Implementierung von Gegenmaßnahmen
  • Verknüpfung der Prozessregelung mit Steuerungsempfehlungen des CPS.
  • Dokumentation der Arbeitsschritte & Verfassen der Diplomarbeit

Wissenschaftlicher Beitrag

  • Entwicklung und Implementierung eines CPS, welches einen RIM Prozesses erfassen und abbilden kann.
  • Aufbau von Methoden um auf Basis des CPS Änderungen des RIM-Prozesses erkennen und bewerten zu können
  • Retrofitting der gewonnen Intelligenz in eine bestehende Maschinensteuerung



Montanuniversität Leoben logos

Here’s a link to download logos in full resolutions:

https://qm.unileoben.ac.at/en/qm-documents/q4-communication

 




Booking a Trip for a Conference/Visit/Summer School

Put your documents at TravelApplications/drafts

Here: https://cloud.cps.unileoben.ac.at/index.php/f/977844

Create the folder with your name

 

Travel Planning Checklist

Approval and Registration

  • Initial Planning: Check for a reasonable flight itinerary. Check if 1-2 days before and after the event have a substantially lower price. 
  • Obtain Approval: Secure trip approval from Elmar. Argue according to the initial planning.
  • Travel System Entry: Request Regina to input the trip details into the travel system. Specify which days are for official duties (e.g., conference, lab visits) and which are for personal stay. Provide Regina with the proof of acceptance, or reason to travel.

Booking Essentials

  • Accommodation and Commute Options: Provide a comparison spreadsheet of different options within the budget. Opt for reasonable over the cheapest options.
  • Booking Approval: Get approved by Elmar.
  • Accommodations and Commute: After obtaining approval, book your stay, conference registration, accommodations, etc. 

Travel Insurance

  • Carry Insurance Documentation: If traveling abroad, particularly outside the EU, bring a printed copy of the university’s or other relevant insurance policy

Visa Requirements

  • Include Embassy Commute: If a visa is necessary, incorporate the embassy commute in the travel system and communicate this to the secretary for travel cost reimbursement.
  • Visa Application Time: Visa application efforts are recognized as working hours.

After the Travel

  • Receipts: After the end of the trip, provide Regina with all the receipts, invoices, and tickets from:
      • Airplanes, trains, buses, and boats: tickets, invoices, bank statement
      • Accommodation: invoice, bank statement
  • Registrations: invoice, bank statement
  • etc.

 

Important Notes

  • OEBB Trains: The chair has a membership with OBB, please book the ticket in the user’s name. You can obtain the user’s login information from Regina.
  • After the travel: Keep all original receipts and submit them to Regina after returning.
  • Report Everything: Due to Austrian law for work insurance coverage, you must inform Regina by email if you will be outside the university zone during working hours, even for a few hours.
  • Private Stay: A private stay cannot exceed 50% of the duration of the working days. For example, if a conference is for six days, your private stay must be a maximum of three days. Otherwise, the university will cover only 50% of the flight tickets and hotel.

Tips:

  • Credit card with travel coverage (check if hospitalization is included for overseas)



Organizing Wiki Page Categories

Here’s a guide on how to label your categories effectively:

  1. wiki_phds: This category should encompass all aspects of your day-to-day life as a PhD student.

  2. wiki_road_to_thesis: Include guidelines, tips, and resources related to various stages of thesis writing, from proposal development to final defense preparations.

  3. wiki_hard_software: Use this category to share information, tutorials, and updates about the hardware and software used in your research projects.

  4. wiki_scientific_research_aspects: Discuss methodologies, data analysis techniques, experimental setups, and anything else related to the scientific rigor of your work.

  5. wiki_teaching_aspects: This category is dedicated to sharing insights, strategies, and resources for effective teaching, whether it’s leading a seminar, designing a course, or mentoring undergraduates.

  6. wiki_career_aspects: This category covers everything related to career development and professional growth.

The category label determines where the post will appear in its respective section.