Yannic Kilcher Videos (Audio Only) cover image

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Yannic Kilcher Videos (Audio Only)

00:00

How Can We Modify a Single Layer Neural Network?

The old name for this is a linear associated memory. It goes way back to the 1970s, right? When people were like, what can you use a single layer neural network for? And one of the leading hypothesis was it just stores key value associations. So now we ask the question, how can we modify such a network such that it kind of learns a new fact or changes its mind about one of the facts that it knows? Well, in a, the attack surface right here is going to be these MLP modules,. namely updating the weights of the MLP modules, such that they change their mind about a fact.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app