MLOps.community  cover image

All About Evaluating LLM Applications // Shahul Es // #179

MLOps.community

00:00

Introduction

Shahul S discusses his work in the evaluation space and projects such as Open Assist and RAGAS, exploring how to approach evaluation in the open source LLM ecosystem including debugging and troubleshooting, examining open source models, prompts, hardware, vector databases, and benchmarking.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app