AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Difference Between Seven and 70 Billion Parameter Models
Seven billion to 70 billion, being an order of magnitude jump on that. Why would you have something fairly close to that at 13 billion parameters? Like what's the difference in seven and 13 when the next step is all the way up to 70? Well, what what's the rationale you think?Yeah, so it is interesting actually if I'm understanding right from some of the sources that I've that I've been reading,. There was actually a 30 or 34 billion model that they were also had in prerelease and were tuning. So there was another one that kind of fit in that slot that is kind of missing that gap like you're talking about.