"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Popular Mechanistic Interpretability: Goodfire Lights the Way to AI Safety

Aug 17, 2024
Dan Balsam, CTO of Goodfire with extensive startup engineering experience, and Tom McGrath, Chief Scientist focused on AI safety from DeepMind, dive into mechanistic interpretability. They explore the complexities of AI training, discussing advances like sparse autoencoders and the balance between model complexity and interpretability. The conversation also reveals how hierarchical structures in AI relate to human cognition, illustrating the need for collaborative efforts in navigating the evolving landscape of AI research and safety.
01:55:33

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • The podcast highlights the crucial advancements in mechanistic interpretability that empower researchers to diagnose and enhance AI model performance.
  • GoodFire's mission focuses on understanding AI models' inner workings to ensure safer deployment and accountability in AI technologies.

Deep dives

Introducing GoodFire

GoodFire is a company co-founded by Dan Balsam and Tom McGrath, focusing on mechanistic interpretability of AI models. Dan serves as the CTO with experience as a startup engineer while Tom, the chief scientist, has a background in AI safety research at DeepMind. Their mission is to understand AI models' internal workings to engineer solutions for AI control and safety. This field has seen substantial advancements in recent years, led by organizations like Anthropic, DeepMind, and OpenAI, enabling progress in tackling the AI black box problem.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode