Yannic Kilcher Videos (Audio Only) cover image

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Yannic Kilcher Videos (Audio Only)

00:00

Interpretability and the Practical Site

There's been a series of papers from Jiva, from Israel, who have been looking at the structure of computations inside the network. And so our paper is another contribution in this direction. We're really focusing on using causal probes to ask that question and see how the network responds when we make changes. The interpretability research is always a bit shrouded in mystery because there are something like 10,000 different explanations that could explain a given fact.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app