4min chapter

The Gradient: Perspectives on AI cover image

Hugo Larochelle: Deep Learning as Science

The Gradient: Perspectives on AI

CHAPTER

Scaling in a Second

I do want to dive a little bit into your thoughts on scaling in a second but just before that as a last paper to mention here you worked with my wonderful friend Harijo on fortuitous forgetting which I think was a very interesting paper. One insight in this particular work is that maybe essentially good features that generalize well are features that are useful in many contexts and one way of kind of simulating having many contexts is if you have a neural net that trains and then the top layers they keep being reset so but you never reset the bottom layers.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode