Generally Intelligent cover image

Episode 20: Hattie Zhou, Mila, on supermasks, iterative learning, and fortuitous forgetting

Generally Intelligent

CHAPTER

The Story of Coherent Gradients

It's not clear that high gradient coherence really should lead to better generalization, either. I think the story of coherent gradients is very simple. It makes sense, but it's probably not the full story. But we don't know if that's harmful either. Could there be a way that that's beneficial? It's possible.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner