
[HUMAN VOICE] "How useful is mechanistic interpretability?" by ryan_greenblatt, Neel Nanda, Buck, habryka
LessWrong (Curated & Popular)
00:00
Debunking the Usefulness of Mechanistic Interpretability
In this chapter, the speakers discuss their disagreements with a previous post about the usefulness of mechanistic interpretability and argue that some people should still worry about usefulness or theory of change. They also explore the existence of induction heads and French neurons, debating whether our world has these phenomena or if alternative explanations exist.
Transcript
Play full episode