AXRP - the AI X-risk Research Podcast cover image

19 - Mechanistic Interpretability with Neel Nanda

AXRP - the AI X-risk Research Podcast

00:00

Why Do You Think Induction Was the First Thing at Tulum?

induction is just such an important thing for the model to be doing that it is worth its while to figure this out. It's also a pretty distinctive thing to spot if you're looking at an attention pattern. The exact way you embed positional information can matter and is there everything's annoying okay but yeah my guess would basically just be the induction is really useful if you want to predict the next token.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app