
Lee Sharkey
Author of the blog post 'Mech interp is not pre-paradigmatic'. Speaker at GDM, specializing in mechanistic interpretability and parameter decomposition.
Best podcasts with Lee Sharkey
Ranked by the Snipd community

Jun 17, 2025 • 30min
“Mech interp is not pre-paradigmatic” by Lee Sharkey
In this discussion, Lee Sharkey, a specialist in mechanistic interpretability, challenges the notion that Mech Interp is pre-paradigmatic. He explores the evolution of mechanistic interpretation through distinct waves, addressing the crises within both first and second waves. Sharkey emphasizes the importance of paradigm shifts in scientific understanding and introduces the concept of parameter decomposition in neural networks. He advocates for a potential third wave that could resolve ongoing challenges, inviting collaboration in this emerging field.

Jun 3, 2025 • 2h 16min
41 - Lee Sharkey on Attribution-based Parameter Decomposition
Lee Sharkey, an interpretability researcher at Goodfire and co-founder of Apollo Research, shares his insights into Attribution-based Parameter Decomposition (APD). He explains how APD can simplify neural networks while maintaining fidelity, discusses the trade-offs of model complexity and performance, and delves into hyperparameter selection. Sharkey also draws analogies between neural network components and car parts, highlighting the importance of understanding feature geometry. The conversation navigates the future applications and potential of APD in optimizing neural network efficiency.