Debunking the Usefulness of Mechanistic Interpretability

In this chapter, the speakers discuss their disagreements with a previous post about the usefulness of mechanistic interpretability and argue that some people should still worry about usefulness or theory of change. They also explore the existence of induction heads and French neurons, debating whether our world has these phenomena or if alternative explanations exist.

Play episode from 06:57

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app