Generative Now | AI Builders on Creating the Future cover image

Inside the Black Box: The Urgency of AI Interpretability

Generative Now | AI Builders on Creating the Future

00:00

Anthropic's Interpretability Goals

Nnamdi asks why Anthropic invests in interpretability; Jack explains root-cause debugging, preventing reward hacks, and shaping training.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app