AI Safety Fundamentals: Alignment cover image

Embedded Agents

AI Safety Fundamentals: Alignment

00:00

The Four Complications of Embedded Agency: Decision Theory and Embedded World Models

An exploration of the four complications of embedded agency - decision theory, embedded world models, robust delegation, and subsystem alignment - and how they are interconnected. The challenges of embedded optimization and creating accurate world models within a smaller agent are discussed, along with the problems of non-Bayesian updates and observer-based world models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app