LessWrong (Curated & Popular)

[Linkpost] “METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman

Mar 19, 2025
Discover a groundbreaking approach to measuring AI performance by focusing on task lengths. The discussion reveals a striking trend: AI's ability to tackle longer tasks is doubling every seven months. Predictions suggest that within a decade, AI could independently manage complex software tasks that usually take humans days or weeks. This fascinating analysis sheds light on the rapid evolution of AI capabilities and its future implications.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

AI Task Length Measurement

  • AI performance is being measured by task length completion.
  • This metric has shown consistent exponential growth over the past six years.
INSIGHT

Exponential Growth in AI Capabilities

  • AI task length completion doubles approximately every seven months.
  • Within a decade, AI could handle tasks currently requiring days or weeks of human work.
Get the Snipd Podcast app to discover more snips from this episode
Get the app