
8 - Assistance Games with Dylan Hadfield-Menell
AXRP - the AI X-risk Research Podcast
00:00
Is Inverse Reinforcement Learning a Good Solution to Assistance Games?
In inverse reinforcement learning, people learn the reward function from a person's perspective. In co operative viorel, there are actually two agents in the environment: There's the person and the robot. People will adapt their behavior when some one is wat ing them to be more informative or try to better accomplish their goals. The reason why it's crucial is that as we are building representations of our goals for future and more and increasingly advanced systems to optimize. We are the humans in this assistanc right? It's not actually co operative iorel it's got multiple people, non stationary object, non stationary rewards, partial information up the wazoo, all kinds of things. But ultimately
Play episode from 19:47
Transcript


