The Inside View cover image

David Bau on Editing Facts in GPT, AI Safety and Interpretability

The Inside View

00:00

The Importance of Interpretability in Machine Learning

We're permanently in an out of domain era in machine learning. We have a really good handle on how to control the behavior of a model when you're deploying it in domain. But when you're out of domain, we know that these models can do unexpected things. And so I think that one of the big issues with out of domain behavior is the potential that you might have models that end up being unsafe.

Play episode from 02:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app