Crazy Wisdom

Episode #520: Training Super Intelligence One Simulated Workflow at a Time

Jan 5, 2026
In this engaging discussion, AI practitioner Josh Halliday, who specializes in training advanced models and synthetic data, shares his expertise in reinforcement learning environments. He highlights the evolution of AI training, emphasizing the transition to high-quality, expert-verified data as the main bottleneck in development. Josh also delves into the fascinating uses of gaming engines for AI training, the challenges of data collection, and the philosophical implications of our dependence on AI. He predicts rapid advancements in robotics and emerging job opportunities in AI training.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
00:00 / 00:00

Simulations Fill Dangerous Data Gaps

  • Unity's real-time 3D engines let teams generate synthetic data for dangerous or expensive edge cases.
  • High-fidelity simulated scenarios replace impossible real-world data collection for training vision models.
00:00 / 00:00

RL Teaches By Iterative Failure

  • Reinforcement learning trains agents by trial and error with reward signals.
  • Agents reset after failures and iterate until they reliably solve the task.
00:00 / 00:00

Render Fidelity Matters For Vision Models

  • Ray tracing and path tracing mimic real light paths to increase visual fidelity in simulations.
  • Higher visual realism reduces model confusion in safety-critical vision tasks.
Get the Snipd Podcast app to discover more snips from this episode
Get the app