
Embedded Agents
AI Safety Fundamentals: Alignment
The Four Complications of Embedded Agency: Decision Theory and Embedded World Models
An exploration of the four complications of embedded agency - decision theory, embedded world models, robust delegation, and subsystem alignment - and how they are interconnected. The challenges of embedded optimization and creating accurate world models within a smaller agent are discussed, along with the problems of non-Bayesian updates and observer-based world models.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.