AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is There a Trade-Off Between Scalability and Performance?
There's some sort of trade off where you sacrifice maybe a little bit of productive power because using a simpler model, but that's necessary in order to be able to run it at all. With fewer weights, the less representative power the model has, and so potentially you're going to get performance that isn't as good as you would on a larger model. Even the smallest precision models are actually binerized neural networks, b n nds, which have one bit weight. But they can work really well. So that all that comes with trade offs, and it's about managing the trade offs and keeping the amount of performance that you need for your application.