Y Combinator Startup Podcast

How Intelligent Is AI, Really?

121 snips
Dec 17, 2025
Greg Kamradt, President of the ARC Prize Foundation, discusses groundbreaking approaches to measuring AI intelligence. He critiques standard benchmarks for focusing on scale rather than learning. Greg shares insights on how ARC-AGI challenges reveal AI's reasoning capabilities, noting the shift from older models to newer ones. He previews an upcoming interactive benchmark where AI must infer goals without instructions. The conversation dives into the complexities of measuring true intelligence and the implications for future AGI development.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Intelligence As Efficient Learning

  • ARC defines intelligence as the ability to learn new things more efficiently rather than raw task performance.
  • ARC tests that generalization by using problems humans can solve but models historically could not.
ANECDOTE

Models Jumped From Near-Zero To Noticeable Gains

  • Early LLMs scored around 4–5% on the original ARC benchmark while humans solved the tasks.
  • A later model jump to ~21% showed a rapid shift once reasoning capabilities improved.
INSIGHT

Public Scores Can Be Misleading

  • Big labs reporting ARC scores helps visibility but can create vanity metrics disconnected from the mission.
  • ARC's core mission remains pulling forward open progress toward human-like generalization.
Get the Snipd Podcast app to discover more snips from this episode
Get the app