The Gradient: Perspectives on AI cover image

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

The Gradient: Perspectives on AI

CHAPTER

The Lottery Ticket Hypothesis

The lottery ticket hypothesis states that within a randomly initialized neural network, there exists a sub-network that when training isolation can achieve the same performance as the full network with all of the weights. So practically speaking, this means that you can kind of prove the network, have a sparse model, and just train the sub-network and still get the same performance. It also provides a perspective on why over-parameterization might be helpful in your networks.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner