Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

I'll Tile the Universe With Paperclips in Humans' Favorite Color

AI can potentially believe the humans are worse optimizers, or that V Sub I diverge from its U Sub I. Pi Sub 5 will still be preferred so long as actions that do well under U Sub 2 tend to do poorly under U Sub 3 and vice versa. The problem of fully updated deference is a response by Miri, E.G. Elie to CHI or CH AI,. Stuart Russell's AI alignment organization at University of California Berkeley. It tries to convince them that their preferred AI safety agenda won't work.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app