Evaluating Large Language Models

The speakers discuss the importance of evaluating large language models and highlight the problem of low data quality. They explore the tension between general evaluation metrics and local data context. They provide advice for users and trainers of language models.

Play episode from 16:57

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app