Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

Let Your AI Be Shut Off

If you build a meta rule that, when updated, or for human, learns a utility function that is not what you want, its meta-ness doesn't help at all on courageability. It's still the better course of action to extrapolate all the tangled info from the humans and then go do that thing sovereignly. Third, in the end, the AI really would let itself be shut off, for the same reasons listed in the original paper. Notice that diagram B in figure one represents absolutely certain and absolutely incorrect knowledge about some of the black lines, which should never happen.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app