The Collective Podcast cover image

Ep. 235 - Steven Zapata

The Collective Podcast

CHAPTER

Is There Value in That?

Lion uses what common crawl does and yeah, well, common crawl has put it out as far as I know it. So they use the links to go look at the images and they train the model with that. Lion uses that to produce these products. They have something like line five B, five billion images, line 400 mill. That's 400 million images and there's different things that go on. And again, what these data sets are, are links to images and the associated text that is within the image.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner