How AI Is Built  cover image

How AI Is Built

#5 Shahul Es and Jithin James on Building Reliable LLM Applications, Production-Ready RAG, Data-Driven Evals

May 3, 2024
Creators of Ragas, Shahul and Jithin, discuss challenges in building LLM applications, emphasizing the importance of evaluation, data quality, and continuous RAG evolution. Practical takeaways include starting with a solid testing strategy and embracing synthetic data to automate test data set creation.
29:40

Podcast summary created with Snipd AI

Quick takeaways

  • Open-source LLMs require post-training optimization for specific use cases.
  • Thorough testing and evaluation prevent unexpected behaviors in deployed RAGs.

Deep dives

Evaluation Techniques for Models

When evaluating models, it is crucial to avoid direct grading as models can behave stochastically. Instead, breaking down tasks into principles and subsets allows for quantifiable answers. By diversifying evaluation approaches, models can provide more relevant outputs.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner