Technically Legal - A Legal Technology and Innovation Podcast

Benchmarking Legal AI: Measuring the Delta Between Man and Machine (Anna Guo Legalbenchmarks.ai)

Oct 23, 2025
Join Anna Guo, a former BigLaw lawyer turned entrepreneur and founder of Legalbenchmarks.ai, as she explores the intriguing world of legal AI tools. Discover her groundbreaking research on measuring the 'delta' between human performance and AI capabilities. Anna discusses how specialized AI often outshines general tools in relevance and coherence for legal tasks. She reveals surprising findings about AI's effectiveness in tasks like contract drafting and highlights when to lean on technology versus human judgment. Get ready to rethink the future of law!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

From Informal Tests To A Public Project

  • Anna started by informally advising friends about new legal AI tools and testing them personally.
  • A LinkedIn post asking for collaborators exploded in interest and launched LegalBenchmarks.ai.
INSIGHT

Measuring The 'Delta' Between Human And AI

  • LegalBenchmarks.ai measures the 'delta' between human performance and how much AI elevates humans.
  • The goal is to quantify where humans plus AI truly outperform humans alone or AI alone.
ADVICE

Assess Beyond Accuracy

  • Evaluate tools across reliability, usability, and platform workflow support, not just accuracy.
  • Test legal adequacy, helpfulness, and integrations like Word or citation checks to capture full utility.
Get the Snipd Podcast app to discover more snips from this episode
Get the app