The Gradient: Perspectives on AI cover image

Hugo Larochelle: Deep Learning as Science

The Gradient: Perspectives on AI

00:00

Scaling in a Second

I do want to dive a little bit into your thoughts on scaling in a second but just before that as a last paper to mention here you worked with my wonderful friend Harijo on fortuitous forgetting which I think was a very interesting paper. One insight in this particular work is that maybe essentially good features that generalize well are features that are useful in many contexts and one way of kind of simulating having many contexts is if you have a neural net that trains and then the top layers they keep being reset so but you never reset the bottom layers.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app