The Slow Ramp Up of Open Source Modeling

The core model that we release open source was trained in three days on 256 GPUs. It would have not been the case, you know, if you're working on a 7B or a 13B. So it's very exciting to see how accessible the lams have become for smaller teams. The performance that we got was surprising in terms of positively surprising.

Play episode from 06:45

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app