
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Cooperative Inverse Reinforcement Learning - Ceril
Ceril is a sub class of assistance games. We built it an extension of a markov decision process, which is a mathematical model for planning used allot in ai systems. What's new is that you also have a robot player. So this is an addition into the m d p. You can think about it as a turn taking scenario where the person goes first, then the robot goes second.
Play episode from 16:02
Transcript


