AXRP - the AI X-risk Research Podcast cover image

21 - Interpretability for Engineers with Stephen Casper

AXRP - the AI X-risk Research Podcast

00:00

The Importance of Interpretability in Engineering Applications

Many people will emphasize the usefulness of interpretability for just kind of making this basic discoveries about networks, understanding them more at a fundamental level. But engineers in the real world benefit from theoretical work or exploratory work all the time as well, even if it's indirect. I think that the lion's share of interpretability research in the AI safety space is kind of focused on basic understanding as opposed to the engineering applications. So I think it's kind of useful to pull closer toward the middle.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app