AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Long Will Expanding Scaling Hold Up?
The most exciting prediction of this theory is not that you can prune x percent of your data but a qualitative one which is that the scaling the power loss scaling we're used to can be qualitatively beaten and achieve exponential scaling. The amount of compute you can save by using one of these other strategies only increases as the size of your datasets grows. There's an important question of how long this exponential scaling will hold up, i think we'll only know the answer once we go and actually do these experiments.