
Shayne Longprey
PhD candidate at MIT and leader of the Data Provenance Initiative, focusing on auditing data used in training AI models.
Best podcasts with Shayne Longprey
Ranked by the Snipd community

9 snips
Jan 22, 2025 • 29min
Will AI Eat Itself?
Julia Kemper, a data scientist at NYU who specializes in AI model outputs, and Shayne Longpre, a PhD candidate at MIT leading the Data Provenance Initiative, discuss the alarming concept of 'model collapse.' They explore how AI's reliance on AI-generated data risks homogenous and bland outputs. Kemper highlights the challenges in improving AI performance under such conditions, while Longpre emphasizes the crucial role of human curation in enhancing AI training data quality. Together, they envision a future where human creativity revitalizes AI’s capabilities.

Oct 25, 2024 • 31min
Tragedy of the (data) commons
Shayne Longprey, an MIT PhD student involved in the Data Provenance Initiative, and Robert Mahari, a researcher at MIT Media Lab and Harvard Law School, delve into key issues surrounding AI data ethics. They discuss the importance of transparency in AI training data and how the decline of publicly available datasets threatens innovation. Their insights from the study "Consent in Crisis" reveal the complexities of data provenance and attribution in generative AI, stressing the need for better consent protocols to safeguard community resources.