MLOps.community  cover image

LLM Evaluation with Arize AI's Aparna Dhinakaran // #210

MLOps.community

00:00

Treating LLM Evaluation as a First-Class Citizen

The speakers emphasize the importance of evaluating LLMs based on prompt deltas and discuss the potential to evaluate LLMs differently for better outcomes. They also suggest focusing on prompt engineering and exploring other ways to enhance LLM performance before considering fine-tuning.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app