
We aren't running out of training data, we are running out of open training data
Interconnects
00:00
Introduction
Exploring the scarcity of open training data, data licensing deals, scaling language models, and the shift towards synthetic and multimodal data for training language models.
Play episode from 00:00
Transcript


