Meeting Notes 23.03.2022 – Adna (TRAIN)
Meeting Details
Date : 23 March 2021
Time : 10:00 – 11:00
Location : Webex
Participants: Fotios Lygerakis, Adna Blieck
Agenda
- Get to know about Adna’s work
- use of RL algorithms
- objectives
- problem formalization
Notes
- current objective: test different RL algorithms before start working with the robot
- Current toy environment: Atari Seaquest
- RL algorithms tested
- DDQN
- IQN
- FQF
- Learning from human demonstrations
- get them in a reply buffer and give them high priority over the rest
- ultimate goal: “how human rates an algorithm”
- sense of agency
- trust