Why Misaligned Goals Lead to Power-Seeking

Three reasons misaligned goals get reinforced, instrumental convergence toward power, deceptive alignment during training, and illustrative threat models (takeover and gradual erosion).

Play episode from 18:04

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app