Generally Intelligent cover image

Episode 36: Ari Morcos, DatologyAI: On leveraging data to democratize model training

Generally Intelligent

00:00

Rethinking Data Definitions in Language Modeling

This chapter explores the complexities of defining 'data' in language modeling, challenging traditional document-centric perspectives. It highlights the importance of recognizing the nuances in various data contexts and discusses how quality data directly impacts model performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app