Generally Intelligent cover image

Episode 36: Ari Morcos, DatologyAI: On leveraging data to democratize model training

Generally Intelligent

CHAPTER

Navigating Synthetic Data in Model Training

This chapter explores the complexities and implications of utilizing synthetic data in the training of models, emphasizing the need for careful curation to accurately reflect the desired data distribution. It discusses potential benefits, such as augmenting underrepresented areas, while warning against risks like model collapse if synthetic data is used carelessly.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner