
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Inverse Reinforcement Learning - The Opposite of Planning
In inverse reinforcement learning, what you are doing is solving. Well, in this case, the opposite of planning, really. So if planning is the problem of taking in a mark off decision process and producing a sequence of actions that get high reward, in inverse reinforcing learning, you try to infer the reward function that those actions optimize for. The relationship this has to co operative iorel is that in cooperative iorel, you are dealing with similar problems.
Play episode from 18:03
Transcript


