Replit AI Podcast cover image

03: The Next Generation of LLMs with Jonathan Frankle of MosaicML

Replit AI Podcast

00:00

The Cost of Having a Larger Context Window in LLMs

TPUs weren't that great at RNNs, so they designed kind of a different parallel architecture. We're also looking for whether rapid because we're going to be keep training models based on that framework. So I'm kind of excited what we'll do here. Yeah, I agree. Like, new hardware is unlocking new explorations of architectures. And I guess you guys are the first one to be able to do these abditional studies. But it comes at a cost. There's a cost, but honestly, the cost isn't that big.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app