Generally Intelligent cover image

Episode 20: Hattie Zhou, Mila, on supermasks, iterative learning, and fortuitous forgetting

Generally Intelligent

CHAPTER

How to Find Sub-Networks That Are the Right Shape of the Solution?

It's not clear exactly why setting it to zero doesn't seem to harm that process. You're still able to find a similar solution. I think Jonathan Franco also showed in some of the papers that these solutions are linearly connected. So they're all arriving at the same basin in the solution space. Right, so it's almost just like the same solution you find when you train a model originally. Except now you kind of move that around a bit such that it's a sparse model.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner