4min chapter

Machine Learning Street Talk (MLST) cover image

Neel Nanda - Mechanistic Interpretability

Machine Learning Street Talk (MLST)

CHAPTER

The Inductive Bias of Deep Learning

The idea here is rather than giving it a convolution you give it this attention mechanism where each token gets a query saying what it's looking for. The model looks from each destination token to the source tokens earlier on with the keys that are most relevant to the current query and models. Each head acts independently of the others and add to their outputs together and just directly add to the residual string. This can result in interesting bugs like there's this motif of a skip trigram the model realizes that if the current thing is three and two has appeared in the past then four is more likely to come next.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode