
Generative AI Models Are Sucking Up Data from All Over the Internet, Yours Included
Science Quickly
00:00
Introduction
Discussion on the challenges and concerns regarding the use of copyrighted data to train AI models, including lawsuits filed by writers and artists. Also explores how AI companies obtain data through webcrawlers and web scrapers, highlighting the decreasing transparency of big tech companies and the impact of OpenAI's release of GPT-3.5 and GPT-4.
Transcript
Play full episode