Why enterprise AI lives or dies on applied research | Contextual AI’s Elizabeth Lingg

Sep 16, 2025

In this discussion, Elizabeth Lingg, Director of Applied Research at Contextual AI, shares insights from her esteemed career at Microsoft and Apple. She explores the challenges of turning AI research into reliable products and emphasizes the importance of correlating accuracy with customer satisfaction. Elizabeth highlights the necessity of specialized AI tailored to unique business needs and advocates for a collaborative approach between research and engineering teams. Her expert advice on measuring AI impact through diverse metrics provides a roadmap for effective enterprise AI integration.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

ANECDOTE

Metrics Can Drive The Wrong Behavior

Two engineering managers showed up at a conference with every AI tool and a dashboard but felt stuck on measuring real impact.
Their execs obsessed over PR throughput, creating pressure to game metrics rather than improve developer quality.

INSIGHT

Why Models Hallucinate

OpenAI's paper argues hallucinations stem from model incentives and post-training evaluation pressure.
Models prefer making confident guesses over saying "I don't know" because evals penalize blank answers.

ADVICE

Link Internal Metrics To Customer Outcomes

Correlate inner-loop metrics (accuracy, recall) with outer-loop metrics (usage, satisfaction) to measure real impact.
Use regression and multiple metrics to determine which model features drive customer outcomes.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

What does it take to transform a brilliant AI model from a research paper into a product customers can rely on? We're joined by Elizabeth Lingg, Director of Applied Research at Contextual AI (the team behind RAG), to explore the immense challenge of bridging the gap between the lab and the real world. Drawing on her impressive career at Microsoft, Apple, and in the startup scene, Elizabeth details her journey from academic researcher to an industry leader shipping production AI.

Elizabeth shares her expert approach to measuring AI impact, emphasizing the need to correlate "inner loop" metrics like accuracy with "outer loop" metrics like customer satisfaction and the crucial "vibe check." Learn why specialized, grounded AI is essential for the enterprise and how using multiple, diverse metrics is the key to avoiding model bias and sycophancy. She provides a framework for how research and engineering teams can collaborate effectively to turn innovative ideas into robust products.

Check out:

Register now: Closing the AI gap: Exceeding executive expectations for AI productivity

Follow the hosts:

Follow Ben
Follow Andrew

Follow today's guest(s):

Learn more about Contextual AI: Contextual.ai Website
Follow Contextual AI on Social Media: LinkedIn | X (formerly Twitter)
Connect with Elizabeth: LinkedIn

Referenced in today's show:

OFFERS

Start Free Trial: Get started with LinearB's AI productivity platform for free.
Book a Demo: Learn how you can ship faster, improve DevEx, and lead with confidence in the AI era.

LEARN ABOUT LINEARB

AI Code Reviews: Automate reviews to catch bugs, security risks, and performance issues before they hit production.
AI & Productivity Insights: Go beyond DORA with AI-powered recommendations and dashboards to measure and improve performance.
AI-Powered Workflow Automations: Use AI-generated PR descriptions, smart routing, and other automations to reduce developer toil.
MCP Server: Interact with your engineering data using natural language to build custom reports and get answers on the fly.