The Jim Rutt Show cover image

Currents 087: Shivanshu Purohit on Open-Source Generative AI

The Jim Rutt Show

00:00

The History of Deep Learning

In 2021, we released the largest available open source text corpus. It consisted of like 300 billion tokens and tokens are basically words in like language model lingo. So you could just use that to train your own models basically. And it got us quite a lot of attention from multiple companies,. One of which was a cloud provider named Corviv. They were interested in helping us out scale even more; they offered to build their own data center.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app