Security, Spoken cover image

The Fight Against AI Comes to a Foundational Data Set

Security, Spoken

00:00

The Conflict Between Common Crawl and Publishers Over AI Training Data

The chapter delves into the challenges faced by Common Crawl as publishers, mainly media outlets in Denmark, are requesting the removal of their articles from datasets over copyright concerns related to AI usage. Common Crawl decides to comply with the demands to avoid legal disputes with media companies and copyright owners.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app