AXRP - the AI X-risk Research Podcast cover image

7 - Side Effects with Victoria Krakovna

AXRP - the AI X-risk Research Podcast

00:00

The Nuclear Power Plant Example and the Next Paper

In this example, agent's task is to go to the store and buy something. And in order to do that, it needs to leave the house. So once you've opened the door, your step wise base line will kind of reset to the door being open. The effect of having the door open kind of don't really matter for yodo getting food for for the user, or whatever. This is like a clear example of a desirable offsetting,. where if you have these, like, instrumental effects that are not actually part of the goalwil want them to be undone.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app