80,000 Hours Podcast cover image

#226 – Holden Karnofsky on unexploited opportunities to make AI safer — and all his AGI takes

80,000 Hours Podcast

00:00

MAPS vs. Misalignment: Distinguishing the Risks

Holden distinguishes routine misalignment from misaligned power‑seeking (MAPS), examining empirical signs and training implications for frontier models.

Play episode from 02:49:30
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app