"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

E48: Mechanizing Mechanistic Interpretability with Arthur Conmy

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Neutralizing Neural Components

This chapter explores the methodology for selectively modifying neural network connections to assess their impact on output performance. It emphasizes maintaining clean activations while evaluating the influence of specific components, such as attention heads, on the overall model integrity.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app