LessWrong (Curated & Popular) cover image

[HUMAN VOICE] "How useful is mechanistic interpretability?" by ryan_greenblatt, Neel Nanda, Buck, habryka

LessWrong (Curated & Popular)

00:00

Debunking the Usefulness of Mechanistic Interpretability

In this chapter, the speakers discuss their disagreements with a previous post about the usefulness of mechanistic interpretability and argue that some people should still worry about usefulness or theory of change. They also explore the existence of induction heads and French neurons, debating whether our world has these phenomena or if alternative explanations exist.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app