Future of Life Institute Podcast cover image

Daniela and Dario Amodei on Anthropic

Future of Life Institute Podcast

00:00

Is There a Space for Interpretability Research?

There are other kind of large, large organized efforts that are focused on mechanistic interpretability. Trying to mechanistically map and understand the internal principles inside inside large models is a big field. There's ot less of that has been done in the broderico system than we'd like there to be. We know of folks who are starting to think about it. But i don't want to give a misleading impression here,. Like, people are interested in understanding the particular part of a model that led to a particular output.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner