Yannic Kilcher Videos (Audio Only) cover image

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Yannic Kilcher Videos (Audio Only)

00:00

Is There a Difference Between the Foot Forward Layer and the MLP Layer?

I think these fan out fan in feet forward layers are really sponges for information. They can absorb a huge amount of basically memorized information. I do think there's sort of one of the unsung heroes of these big transformer networks, these huge massive high capacity memories. Some of the newer transformers, they add some gating to these MLP layers to increase the capacity even further.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app