How to Prune a Model?

In the oreginal model, outpots of which and multified attention are concacinated. And for each of the model, the most important had on the first layers is etentient red. So intuitively, why do we want to many miles the probably tatits open? So if the dunde, the model doesn't have to ruin some heads, really, it wants to do nothing. If i don't say it's explicitly model we want to remove some heads or from you it doesn't has to do it. Wik wecan push, more or less, to switchless important heads.

Play episode from 09:36

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app