
#66 – Michael Cohen on Input Tampering in Advanced RL Agents
Hear This Idea
00:00
The Different Models of Inverse Reinforcement Learning
In inverse reinforcement learning, the goal information is not a reward. It's observations of human behavior and tries to reason about what that goal was. So it's different from imitation learning by going beyond just trying to imitate actual human behavior.
Play episode from 02:10:18
Transcript


