Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

Using a Meta Utility Function to Create an Aligned Sovereign

Eliezer: Any rule that we or anybody knows how to state updated off observations until the point when the AI doesn't think it's worth continuing to hunt down further observables and tangled with the utility function. Miri: If you missed a line of code from the textbook, the resulting creation would not let you shut it down or edit that line of code back in. Scott: Even if this argument carried and saved us all, it would not have solved courageability. It would have just solved the problem of an aligned sovereign instead.

Play episode from 38:57
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app