The Inside View cover image

David Bau on Editing Facts in GPT, AI Safety and Interpretability

The Inside View

00:00

The Importance of Interpretability in Machine Learning

We're permanently in an out of domain era in machine learning. We have a really good handle on how to control the behavior of a model when you're deploying it in domain. But when you're out of domain, we know that these models can do unexpected things. And so I think that one of the big issues with out of domain behavior is the potential that you might have models that end up being unsafe.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app