AI Safety Fundamentals: Alignment

Embedded Agents

6 snips
May 13, 2023
Computer scientist Scott Garrabrant discusses the challenge of building learning agents for real-world goals. The podcast explores the concept of embedded agents, the four complications of embedded agency, and open problems in world models and subsystem alignment. It also delves into the conflicts that arise when spinning up sub-agents with different goals.
Ask episode
Chapters
Transcript
Episode notes