The Advantages of Disaggregated Memory for Language Modeling

This took like more than a year's of work to do the software and the hardware plumbing. Our architecture is entirely disaggregated. We can have a two terabyte memory system. We can swap it out for a 10 terabyte. The result of that is that we can have arbitrarily large language models without blowing up the chip. That's the key insight from this.

Play episode from 34:44

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app