2min chapter

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch cover image

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

CHAPTER

The Minefield of Evaluating Language Models: Benchmarks vs. Real-World Application

This chapter explores the complexities of assessing large language models, emphasizing the misleading nature of standardized benchmarks. It highlights the gap between high benchmark performance and real-world applicability, questioning the viability of these assessments due to potential data contamination.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode