Generally Intelligent cover image

Episode 36: Ari Morcos, DatologyAI: On leveraging data to democratize model training

Generally Intelligent

00:00

Navigating Synthetic Data in Model Training

This chapter explores the complexities and implications of utilizing synthetic data in the training of models, emphasizing the need for careful curation to accurately reflect the desired data distribution. It discusses potential benefits, such as augmenting underrepresented areas, while warning against risks like model collapse if synthetic data is used carelessly.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app