Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

The IRL AI's Utility Function

The AI takes human actions as input and makes some guesses about what humans want. It tries its best to reconstruct the human utility function, ending up with some approximation. Chai says it's important to distinguish between a few things here. The true human utility function is at least equivalent of box C on the image above. That is its final form, after it knows everything there is to know.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app