AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Inverse Reinforcement Learning - The Opposite of Planning

In inverse reinforcement learning, what you are doing is solving. Well, in this case, the opposite of planning, really. So if planning is the problem of taking in a mark off decision process and producing a sequence of actions that get high reward, in inverse reinforcing learning, you try to infer the reward function that those actions optimize for. The relationship this has to co operative iorel is that in cooperative iorel, you are dealing with similar problems.

Play episode from 18:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app