Astral Codex Ten Podcast cover image

CHAI, Assistance Games, And Fully-Updated Deference

Astral Codex Ten Podcast

00:00

Introduction

This Machine Alignment Monday post will focus on this imposing-looking article. It has some text that reads, The problem of Fully Updated Deference is an obstacle to using moral uncertainty to create courageability. As the AI's meta-utility function defines its ideal target, then we could tell the AI, you should let us shut you down because we know something about your ideal target that you don't.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app