I see, oh I think I mischaracterized it a bit so the inconsistency is basically if you plot both of these linear parallel lines of loss as a function of data set and mass the function of the compute they are slightly different slopes. At some point one of the slopes has to be a lower slope which is compute will have to maybe overcome or overtake the data slope. So that is interesting as you say it's, then I think kind of it's there's not too much research of this vein. It is starting to be more of it and you do cite a lot of irrelevant literature but these kind of microscopic or microscopic trends that are empirical but seem to hold are definitely interesting to be

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode