Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

The AI's Utility Function - Tile the Universe With Paperclips in Humans' Favorite Color

AI. Refused to be shut off, continue to gather information to fill the holes in its knowledge of the human utility function and then optimize for its true AI utility function. This just proves that it doesn't fail gracefully in the sense of letting itself be turned off. And although I choose a deliberately outrageous example, the same considerations apply if it's 1% different from the true human utility function or 0.1% different. 7. Stuart slash Chai had at least 3 substantial objections to this post.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app