Unsupervised Learning with Jacob Effron cover image

Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation

Unsupervised Learning with Jacob Effron

00:00

Evaluating AI: Challenges and Strategies

This chapter explores the complexities of benchmarking large language models in AI, addressing issues like train-test overlap and the demand for new evaluation methodologies. It highlights the importance of developing structured rubrics to assess model capabilities effectively, particularly in specialized fields such as healthcare and finance.

Play episode from 29:13
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app