Technically Legal - A Legal Technology and Innovation Podcast

Best of 2025 - Benchmarking Legal AI: Measuring the Delta Between Man and Machine (Anna Guo Legalbenchmarks.ai)

10 snips
Dec 18, 2025
In this engaging discussion, Anna Guo, Founder of LegalBenchmarks.ai and former BigLaw attorney, dives into her groundbreaking research on legal AI tools. She reveals how AI can enhance human legal performance, highlighting stark contrasts between general-purpose and legal-specific AI. Key findings show that while AI can outshine humans in contract drafting, humans and AI together may not always outperform the best AI alone. Anna emphasizes the importance of qualitative usability over mere accuracy, making a compelling case for the future of legal technology.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

From BigLaw To Benchmarking AI

  • Anna Guo left BigLaw, built and sold a B2B marketplace, then returned to legal AI research after colleagues asked for tool evaluations.
  • That early demand and a viral LinkedIn post propelled the creation of LegalBenchmarks.ai to systematically test legal AI.
INSIGHT

Measuring The Delta Between Man And Machine

  • LegalBenchmarks.ai measures the 'delta' between human-only, AI-only, and human-plus-AI to quantify how much AI elevates human performance.
  • The goal is to track that gap over time to see where AI outpaces or complements legal expertise.
ADVICE

Use A Three‑Part Evaluation Framework

  • Evaluate legal AI on three dimensions: reliability, usability, and platform workflow support rather than accuracy alone.
  • Include factual/legal adequacy checks, qualitative usefulness, and integration features like citation checks and Word integration.
Get the Snipd Podcast app to discover more snips from this episode
Get the app