3min chapter

The Data Exchange with Ben Lorica cover image

Exhaustion of High-Quality Data Could Slow Down AI Progress in Coming Decades

The Data Exchange with Ben Lorica

CHAPTER

The Key Findings for NLP

If current trends continue as in the past, we will probably run out of data between 2030 and 2050 with a median data around 2040. If you include other languages, it should be more or less similar. To measure the amount of data for NLP, at that point you're just literally measuring volume. So there's no notion of quality.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode