AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Is Inverse Reinforcement Learning a Good Solution to Assistance Games?

In inverse reinforcement learning, people learn the reward function from a person's perspective. In co operative viorel, there are actually two agents in the environment: There's the person and the robot. People will adapt their behavior when some one is wat ing them to be more informative or try to better accomplish their goals. The reason why it's crucial is that as we are building representations of our goals for future and more and increasingly advanced systems to optimize. We are the humans in this assistanc right? It's not actually co operative iorel it's got multiple people, non stationary object, non stationary rewards, partial information up the wazoo, all kinds of things. But ultimately

Play episode from 19:47
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app