LessWrong (Curated & Popular) cover image

[HUMAN VOICE] "How useful is mechanistic interpretability?" by ryan_greenblatt, Neel Nanda, Buck, habryka

LessWrong (Curated & Popular)

CHAPTER

Debunking the Usefulness of Mechanistic Interpretability

In this chapter, the speakers discuss their disagreements with a previous post about the usefulness of mechanistic interpretability and argue that some people should still worry about usefulness or theory of change. They also explore the existence of induction heads and French neurons, debating whether our world has these phenomena or if alternative explanations exist.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner