The Importance of Interpretability in Engineering Applications

Many people will emphasize the usefulness of interpretability for just kind of making this basic discoveries about networks, understanding them more at a fundamental level. But engineers in the real world benefit from theoretical work or exploratory work all the time as well, even if it's indirect. I think that the lion's share of interpretability research in the AI safety space is kind of focused on basic understanding as opposed to the engineering applications. So I think it's kind of useful to pull closer toward the middle.

Play episode from 03:49

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app