Practically Intelligent cover image

Practically Intelligent

E15: Unlocking the Internet's Treasure with Rich Skrenta

Aug 19, 2024
39:53

In this episode of Practically Intelligent, Sinan and Akshay sit down with Rich Skrenta, the Executive Director of the Common Crawl Foundation. Rich shares his extensive experience in data aggregation and AI and how that ties into the history, mission, and future of Common Crawl—a nonprofit organization responsible for one of the largest open-source web data repositories in the world. The three discuss the challenges and opportunities of expanding Common Crawl's global reach, the critical role of curated data in training large language models, and the importance of maintaining open access to the internet in the age of cutting edge AI.Key topics include:

  • The importance of curated data in AI training
  • Challenges of expanding Common Crawl globally
  • The future of open internet access in the AI era

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode