Yannic Kilcher Videos (Audio Only) cover image

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Yannic Kilcher Videos (Audio Only)

00:00

Using Causal Tracing to Determine the Location of the Space Narrow

In a big distributed network, all the states have information that could recover the hidden state. We wanted to test if this is actually true. And what we find is that the MLP corresponds to the early site and then the attention corresponds to the late site. It's not exactly too surprising because the model is going to recall the next fact by outputting the next token. So it's right next to the prediction and the causal impact there isn't too surprising. But what's really interesting is this weird early site that seems at first to be in the middle of nowhere. When we do this kind of experiment averaged over a thousand facts, I think that might be figure two or figure. Yeah

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app