AXRP - the AI X-risk Research Podcast cover image

8 - Assistance Games with Dylan Hadfield-Menell

AXRP - the AI X-risk Research Podcast

00:00

Cooperative Iorel and the Off Switch Game - Is Uncertainty Important?

Inverse reward design looks at a co operative iorel interaction. The person goes first and selects a proxy reward function of some kind, given an observation of a training environment. And now the robot's goal is to maximize utility in this new deployment setting. In inverse reward design, we're arguing that uncertainty about objectives is important.

Play episode from 01:31:20
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app