Practically Intelligent cover image

E4: Evaluating Large Language Models with Nathan Lambert

Practically Intelligent

00:00

Evaluating Large Language Models

The speakers discuss the importance of evaluating large language models and highlight the problem of low data quality. They explore the tension between general evaluation metrics and local data context. They provide advice for users and trainers of language models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app