LessWrong (Curated & Popular) cover image

“Mech interp is not pre-paradigmatic” by Lee Sharkey

LessWrong (Curated & Popular)

00:00

Understanding Parameter Decomposition in Neural Networks

This chapter explores parameter decomposition in neural networks, focusing on how weights can be interpreted as distinct components linked to specific mechanisms. It challenges existing assumptions in mechanistic interpretation and highlights the significance of viewing computations as fundamental over representations.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app