The Collective Podcast cover image

Ep. 235 - Steven Zapata

The Collective Podcast

CHAPTER

Commoncrawl

Commoncrawl is a company that uses AI to scrape the Internet. Lion, another nonprofit based in Germany, then takes what Commoncrawl has made publicly available and repackages it into data sets. It's not just repackaging; they're making it easy for code to interact with it further down the line for AI purposes.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner