
03: The Next Generation of LLMs with Jonathan Frankle of MosaicML
Replit AI Podcast
The Cost of Having a Larger Context Window in LLMs
TPUs weren't that great at RNNs, so they designed kind of a different parallel architecture. We're also looking for whether rapid because we're going to be keep training models based on that framework. So I'm kind of excited what we'll do here. Yeah, I agree. Like, new hardware is unlocking new explorations of architectures. And I guess you guys are the first one to be able to do these abditional studies. But it comes at a cost. There's a cost, but honestly, the cost isn't that big.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.