Practical AI cover image

Only as good as the data

Practical AI

CHAPTER

Data Strategies in Machine Learning

This chapter explores the critical aspects of creating effective test and evaluation sets in machine learning, emphasizing the significance of random sampling and benchmark data. It discusses the interplay between training data and existing benchmarks, showcasing examples such as machine translation and question answering tasks. Additionally, the chapter introduces retrieval augmented generation (RAG) and its impact on data quality, illustrating how users can enrich generative models with their own data while maintaining their core functionality.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode