
Ep. 235 - Steven Zapata
The Collective Podcast
Is There Value in That?
Lion uses what common crawl does and yeah, well, common crawl has put it out as far as I know it. So they use the links to go look at the images and they train the model with that. Lion uses that to produce these products. They have something like line five B, five billion images, line 400 mill. That's 400 million images and there's different things that go on. And again, what these data sets are, are links to images and the associated text that is within the image.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.