Inverse Reinforcement Learning - The Opposite of Planning

In inverse reinforcement learning, what you are doing is solving. Well, in this case, the opposite of planning, really. So if planning is the problem of taking in a mark off decision process and producing a sequence of actions that get high reward, in inverse reinforcing learning, you try to infer the reward function that those actions optimize for. The relationship this has to co operative iorel is that in cooperative iorel, you are dealing with similar problems.

Play episode from 18:03

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app