Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

The AI Will Always Let Itself Be Turned Off

If the AI tries to maximize one or the other utility function, it's only got a 50% chance of getting it right. In real life, there will be much greater uncertainty about what people want. Miri notes that the AI has a sixth option: It can let itself be turned off. This would allow humans to pursue the true human utility function which is correlated with the unknown true AI utility function.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app