AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Cooperative Inverse Reinforcement Learning - Ceril

Ceril is a sub class of assistance games. We built it an extension of a markov decision process, which is a mathematical model for planning used allot in ai systems. What's new is that you also have a robot player. So this is an addition into the m d p. You can think about it as a turn taking scenario where the person goes first, then the robot goes second.

Play episode from 16:02
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app