LessWrong (Curated & Popular) cover image

“Mech interp is not pre-paradigmatic” by Lee Sharkey

LessWrong (Curated & Popular)

00:00

Evolution of Mechanistic Interpretation in Neuroscience

This chapter explores the evolution of mechanistic interpretation (Mechinterp) in neuroscience, particularly in relation to computational paradigms and artificial neural networks. It highlights two distinct waves of thought, detailing the initial skepticism of the first wave and the crises that led to the emergence of a second wave focused on polysemanticity and superposition. The discussion culminates in the consideration of a potential third wave, advocating for innovative approaches to address ongoing conceptual challenges.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app