2min chapter

Machine Learning Street Talk (MLST) cover image

Neel Nanda - Mechanistic Interpretability

Machine Learning Street Talk (MLST)

CHAPTER

How to Solve Modular Addition on Discrete One Hot Encoded Inputs

I think a key thing here is that you are solving modular addition on discrete one hot encoded inputs rather than for arbitrary continuous inputs it's way harder. The model then needs to convert the composed rotation back to the actual answer which is an even more galaxy-brained operation that you can read off from the weights. If you ablate everything that our algorithm says should not matter performance improves while if you ablate any of the bits our algorithm said should matter performance tanks.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode