AI Safety Fundamentals: Alignment cover image

Embedded Agents

AI Safety Fundamentals: Alignment

CHAPTER

The Four Complications of Embedded Agency: Decision Theory and Embedded World Models

An exploration of the four complications of embedded agency - decision theory, embedded world models, robust delegation, and subsystem alignment - and how they are interconnected. The challenges of embedded optimization and creating accurate world models within a smaller agent are discussed, along with the problems of non-Bayesian updates and observer-based world models.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner