Metrics Driven Development (Practical AI #284)

Aug 29, 2024

Shahul, involved in the open-source RAGAS project, joins the discussion on metrics-driven development for LLM applications. He sheds light on the critical differences between evaluating models and their applications, emphasizing the need for tailored assessments. The conversation delves into the role of synthetic test data, and how innovative speech AI models convert voice data into actionable insights. Shahul also highlights the promise of improved evaluation standards and the future possibilities of LLM applications powered by tool use and enhanced metrics.

Ask episode

Chapters

Transcript

Episode notes

Intro

00:00 • 5min

Evaluating LLM Applications vs. Models

05:18 • 12min

Transforming Voice Data with AI

17:23 • 7min

Evaluating Language Model Metrics

24:31 • 11min

Exploring Innovations in LLM Applications and Synthetic Data

35:31 • 7min