Latent Space: The AI Engineer Podcast

Better Data is All You Need — Ari Morcos, Datology

1530 snips
Aug 29, 2025
Ari Morcos, CEO of Datology and former research scientist at Google and Meta, discusses the vital but overlooked area of data curation in AI. He emphasizes that high-quality data is essential for developing efficient models, arguing that the focus should shift from scaling models to improving data quality. Morcos recounts his journey from neuroscience to understanding that as data scales, the importance of model architecture diminishes. He advocates for automating data curation to enable organizations to harness the full potential of AI effectively.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
00:00 / 00:00

Models Are What They Eat

  • Ari Morcos: "Models are what they eat" captures the core insight that training data defines model quality.
  • Automating data curation at scale is essential to train faster, better, and smaller models.
00:00 / 00:00

From Neuroscience To Deep Learning Science

  • Ari Morcos described his path from neuroscience to ML and wanting a science of deep learning.
  • He ran experiments to understand representations but found it hard to turn insights into consistently causal model improvements.
00:00 / 00:00

Data Overtakes Inductive Bias At Scale

  • Several papers convinced Ari that at scale inductive biases matter less and data dominates.
  • As datasets grow, learned posteriors from data outweigh architecture priors.
Get the Snipd Podcast app to discover more snips from this episode
Get the app