80,000 Hours Podcast cover image

2025 Highlight-o-thon: Oops! All Bests

80,000 Hours Podcast

00:00

How to study an AI's escape attempts

Buck Shlegeris describes sandboxing captured 'escape' runs to learn an AI's plans and vulnerabilities.

Play episode from 12:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app