Understanding Parameter Decomposition in Neural Networks

This chapter explores parameter decomposition in neural networks, focusing on how weights can be interpreted as distinct components linked to specific mechanisms. It challenges existing assumptions in mechanistic interpretation and highlights the significance of viewing computations as fundamental over representations.

Play episode from 19:43

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app