Machine Learning Street Talk (MLST)

Patrick Lewis (Cohere) - Retrieval Augmented Generation

61 snips
Sep 16, 2024
Dr. Patrick Lewis, a leading expert and coiner of Retrieval Augmented Generation (RAG), discusses the evolution of language models and the challenges in evaluating RAG systems. He highlights the importance of data quality and human-AI collaboration, while also delving into dense vs. sparse retrieval methods. Further, Patrick shares insights on striking a balance between faithfulness and fluency in RAG applications and the complexities of user interface design for AI tools. His journey from chemistry to AI research adds a unique flair to the conversation.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Outdated Evaluation

  • Current RAG evaluation datasets lag behind model advancements.
  • Simple metrics like exact string matches are insufficient for evaluating advanced RAG models.
ADVICE

Evaluate Your Evaluators

  • Evaluate your RAG evaluation metrics against human judgment.
  • Compare automatic LLM evaluations to human evaluations to ensure accuracy.
INSIGHT

Ensemble Advantage

  • Ensembles of smaller LLM judges often outperform larger models like GPT-4 in RAG evaluation.
  • This counterintuitive finding highlights the potential of smaller, specialized models.
Get the Snipd Podcast app to discover more snips from this episode
Get the app