AI Safety Fundamentals

Deceptively Aligned Mesa-Optimizers: It’s Not Funny if I Have to Explain It

Jan 4, 2025
Ask episode
Chapters
Transcript
Episode notes