Yannic Kilcher Videos (Audio Only) cover image

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Yannic Kilcher Videos (Audio Only)

00:00

The MLPs, the MLP Layers, Are Really Simple.

The hypothesis is that the MLPs may be storing this factual knowledge. There's nothing inherent in the words space needle, where it would make sense to predict Seattle. And and the theory is that the association between that word space needle and the location of Seattle is specifically stored in these MLP layers in the middle of the network. But as soon as we import the MLP connections or states, I'd rather want to say the modules, the MLP modules, remember here we're talking about forward signals not weights.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app