2min chapter

The Stack Overflow Podcast cover image

MosaicML: Deep learning models for sale, all shapes and sizes

The Stack Overflow Podcast

CHAPTER

The 7B MPT Model Is a Demo Track

We train a lot of models for contract, which means many of them don't see the light of day. So how do people know that we can actually train good models? Mm hmm. The 7B, you know, I'll say is the baby of the family. It definitely has some bigger batter siblings that are available for our customers but really it's a demo. A lot of folks do want to train from scratch. They want to have complete control over the pre-training data. And they're built for heavy duty fine tuning. We chose to use alibi in such a way that basically you can use as long of a context as you can fit on the GPU.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode