The Gradient: Perspectives on AI cover image

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

The Gradient: Perspectives on AI

CHAPTER

Is the Basin of Attraction Really Large?

The idea that the basin of attraction, I guess the size of it could be pretty large is interesting. So if you were to keep perhaps the relative weights or the relative magnitudes of the weights you initialize the same along with the signs, then kind of how far you can go in terms of messing with that? We just set all the ways to like 0.1, and that just like already worked, and at least for the simple MNIST experiments. And in the paper, we ended up setting it to a value that was like proportional to kind of the initialization that we still, I don't think it would work.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner