Hear This Idea cover image

#66 – Michael Cohen on Input Tampering in Advanced RL Agents

Hear This Idea

00:00

The Different Models of Inverse Reinforcement Learning

In inverse reinforcement learning, the goal information is not a reward. It's observations of human behavior and tries to reason about what that goal was. So it's different from imitation learning by going beyond just trying to imitate actual human behavior.

Play episode from 02:10:18
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app