"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

The Data Factory: Inside the $100B Race for Post-Training Supremacy, with Labelbox CEO Manu Sharma

216 snips

Jul 8, 2025

In this engaging discussion, Manu Sharma, Founder and CEO of Labelbox—known for providing cutting-edge training data to AI labs—explores the evolution of AI training methods. He highlights the shift from simple labeling to sophisticated reinforcement learning environments. Sharma reveals how AI labs are investing massively in training data and discusses the nuances of post-training strategies. He shares insights on the competitive AI landscape, the importance of human data, and the interplay of creativity and AI in modern industries.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Billions Spent on Specialized AI Data

Frontier AI labs now spend over a billion dollars annually on specialized training data for advanced tasks.
The shift from supervised to reinforcement learning reflects the growing complexity and specialization of AI training data.

INSIGHT

Post-training Emphasizes Reinforcement Learning

Post-training budgets increasingly focus on reinforcement learning for skill-specific tasks like coding and math.
Models are tested with verifiable rewards, which enables faster improvement in reasoning and coding.

INSIGHT

Human Data Anchors AI Alignment

Human expert data anchors AI alignment by providing quality judgments where right answers aren't known.
Reinforcement learning setups increasingly use graders and rubrics instead of step-by-step human reasoning traces.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

Manu Sharma, founder and CEO of Labelbox, explains how frontier AI training data has evolved far beyond simple labeling to sophisticated reinforcement learning environments where domain experts create "gyms" for models to develop complex skills. With every Western frontier lab now spending over a billion dollars annually on training data, the conversation traces the shift from supervised learning to reinforcement learning from verifiable rewards, particularly for coding, mathematical reasoning, and computer use. Sharma reveals how Labelbox operates as a vertically integrated data factory, conducting over 2,000 AI-powered expert interviews daily and paying top specialists more than $250,000 annually. The discussion provides essential insights into the red-hot training data market that's reshaping AI development following major deals like Meta's $15B acquisition of Scale AI.

Sponsors:

Oracle Cloud Infrastructure:

Oracle Cloud Infrastructure (OCI) is the next-generation cloud that delivers better performance, faster speeds, and significantly lower costs, including up to 50% less for compute, 70% for storage, and 80% for networking. Run any workload, from infrastructure to AI, in a high-availability environment and try OCI for free with zero commitment at https://oracle.com/cognitive

The AGNTCY:

The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at https://agntcy.org

NetSuite by Oracle:

NetSuite by Oracle is the AI-powered business management suite trusted by over 42,000 businesses, offering a unified platform for accounting, financial management, inventory, and HR. Gain total visibility and control to make quick decisions and automate everyday tasks—download the free ebook, Navigating Global Trade: Three Insights for Leaders, at https://netsuite.com/cognitive

PRODUCED BY:

https://aipodcast.ing

SOCIAL LINKS:

Website: https://www.cognitiverevolution.ai

Twitter (Podcast): https://x.com/cogrev_podcast

Twitter (Nathan): https://x.com/labenz

LinkedIn: https://linkedin.com/in/nathanlabenz/

Youtube: https://youtube.com/@CognitiveRevolutionPodcast

Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431

Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk