Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

The on Switch Problem Is Making Convergence Hard

I don't feel comfortable trying to adjudicate a debate between two people who have so much of an expertise advantage over me. The most I'll say is that their crux seems to be whether the AI could end up with an uncorrectably wrong model of the human utility function. If no, everything Stuart says makes sense. If yes, everything Elleza writes does. This doesn't seem to have much to do with the off switch problem per se, except to repeat the basic idea that an AI with a fixed known in quotes U won't want to be switched off.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app