Future of Life Institute Podcast cover image

Daniela and Dario Amodei on Anthropic

Future of Life Institute Podcast

00:00

Is There a Space for Interpretability Research?

There are other kind of large, large organized efforts that are focused on mechanistic interpretability. Trying to mechanistically map and understand the internal principles inside inside large models is a big field. There's ot less of that has been done in the broderico system than we'd like there to be. We know of folks who are starting to think about it. But i don't want to give a misleading impression here,. Like, people are interested in understanding the particular part of a model that led to a particular output.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app