AXRP - the AI X-risk Research Podcast cover image

19 - Mechanistic Interpretability with Neel Nanda

AXRP - the AI X-risk Research Podcast

00:00

What's the Difference Between Sensory Reasoning and Processing?

Sensory reasoning in motor motor is a good description of what the neurons are doing but it also spends a significant amount of its computation on attention. The model first roots the separate tokens of the phrase the Eiffel tower to the final token and then looks up the fact that it's in Paris. This information can be rooted around like several times for example there was this great work by David bound kevin mange looking at how models did factual recall.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app