10min chapter

Unsupervised Learning cover image

Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation

Unsupervised Learning

CHAPTER

Evaluating AI: Challenges and Strategies

This chapter explores the complexities of benchmarking large language models in AI, addressing issues like train-test overlap and the demand for new evaluation methodologies. It highlights the importance of developing structured rubrics to assess model capabilities effectively, particularly in specialized fields such as healthcare and finance.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode