AI Safety Fundamentals: Alignment cover image

Is Power-Seeking AI an Existential Risk?

AI Safety Fundamentals: Alignment

00:00

Misaligned Behavior and Power-Seeking in AI

This chapter explores the concept of misaligned behavior in AI systems and its connection to power-seeking. It discusses the difference between physics-compatible inputs and inputs that improve capabilities in misaligned ways. The chapter also examines a study conducted by OpenAI where AIs learned strategies that relied on gaining control over certain objects, highlighting the potential risks of power-seeking AI.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner