Generally Intelligent cover image

Episode 20: Hattie Zhou, Mila, on supermasks, iterative learning, and fortuitous forgetting

Generally Intelligent

00:00

Is There a Way to Improve Model Performance?

LSTMs are not known for being amazingly good at composition. We don't know why when you do interaction again, when the models talk to each other and try to do well on the task, you don't just immediately forget about everything you've learned. And so I came across another paper called Knowledge Evolution that proposed this very cool sounding idea of how knowledge within a model evolves towards better generalization. So they basically generate a random binary mask which is the same shape as the model. That's a hyperparameter you can tune, but they fix it after every generation.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app