Is There a Space for Interpretability Research?

There are other kind of large, large organized efforts that are focused on mechanistic interpretability. Trying to mechanistically map and understand the internal principles inside inside large models is a big field. There's ot less of that has been done in the broderico system than we'd like there to be. We know of folks who are starting to think about it. But i don't want to give a misleading impression here,. Like, people are interested in understanding the particular part of a model that led to a particular output.

Transcript

Play full episode

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app