The Gradient: Perspectives on AI cover image

Hattie Zhou: Lottery Tickets and Algorithmic Reasoning in LLMs

The Gradient: Perspectives on AI

00:00

Is the Basin of Attraction Really Large?

The idea that the basin of attraction, I guess the size of it could be pretty large is interesting. So if you were to keep perhaps the relative weights or the relative magnitudes of the weights you initialize the same along with the signs, then kind of how far you can go in terms of messing with that? We just set all the ways to like 0.1, and that just like already worked, and at least for the simple MNIST experiments. And in the paper, we ended up setting it to a value that was like proportional to kind of the initialization that we still, I don't think it would work.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app