AI Safety Fundamentals: Alignment cover image

Embedded Agents

AI Safety Fundamentals: Alignment

00:00

Open problems in embedded world models, robust delegation, and subsystem alignment

This chapter explores major open problems in embedded world models, including logical uncertainty, multilevel modeling, ontological crises, robust delegation, trust issues, Vingian reflection, value learning, courageability, and subsystem alignment.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app