
Technically Legal - A Legal Technology and Innovation Podcast Benchmarking Legal AI: Measuring the Delta Between Man and Machine (Anna Guo Legalbenchmarks.ai)
Oct 23, 2025
Join Anna Guo, a former BigLaw lawyer turned entrepreneur and founder of Legalbenchmarks.ai, as she explores the intriguing world of legal AI tools. Discover her groundbreaking research on measuring the 'delta' between human performance and AI capabilities. Anna discusses how specialized AI often outshines general tools in relevance and coherence for legal tasks. She reveals surprising findings about AI's effectiveness in tasks like contract drafting and highlights when to lean on technology versus human judgment. Get ready to rethink the future of law!
AI Snips
Chapters
Transcript
Episode notes
From Informal Tests To A Public Project
- Anna started by informally advising friends about new legal AI tools and testing them personally.
- A LinkedIn post asking for collaborators exploded in interest and launched LegalBenchmarks.ai.
Measuring The 'Delta' Between Human And AI
- LegalBenchmarks.ai measures the 'delta' between human performance and how much AI elevates humans.
- The goal is to quantify where humans plus AI truly outperform humans alone or AI alone.
Assess Beyond Accuracy
- Evaluate tools across reliability, usability, and platform workflow support, not just accuracy.
- Test legal adequacy, helpfulness, and integrations like Word or citation checks to capture full utility.
