
The Fight Against AI Comes to a Foundational Data Set
Security, Spoken
00:00
The Conflict Between Common Crawl and Publishers Over AI Training Data
The chapter delves into the challenges faced by Common Crawl as publishers, mainly media outlets in Denmark, are requesting the removal of their articles from datasets over copyright concerns related to AI usage. Common Crawl decides to comply with the demands to avoid legal disputes with media companies and copyright owners.
Transcript
Play full episode