CXOTalk

Top Data Scientists Explain Bad Data, Poisoned Datasets, and Other AI Killers | CXOTalk #896

44 snips
Oct 9, 2025
Join Dr. David Bray, a tech policy expert at the Stimson Center, and Dr. Anthony Scriffignano, a data science leader, as they dive into the hidden threats of bad data and poisoned datasets in AI. They discuss the Five Ms framework for identifying AI failures and why organizations often rush into AI adoption without proper vetting. Learn about the risks of generative AI, the importance of critical thinking and ethical oversight, and how to recognize malicious data campaigns that can undermine your AI systems.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

AI Needs Trustworthy Foundations

  • Generative AI depends on training data as its foundation and that foundation can be unstable.
  • If you ignore data provenance or missingness, your model may collapse like a castle on quicksand.
ADVICE

Start With Problem, Not Tool

  • Do not lead with tools; start with a clear problem and mission before using AI.
  • Ask how AI produces net-new value and whether the chosen tool fits the problem.
ANECDOTE

Real Errors From Bad Training Data

  • David gave an example where a model incorrectly reported Georgia's population as ~350 million due to a training typo.
  • He also recounted policy deployments that placed COVID resources by residence instead of workplace and misread demand.
Get the Snipd Podcast app to discover more snips from this episode
Get the app