AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
What Do You Mean When You Say Serverless GPUs?
Gasha: Serverless is one of the keys to the kingdom, if you could really do serverless well. We just as a team have chosen to focus mainly on inference, real-time inference. So if there's a user at the other end waiting for a response, we're the ones responsible for making that response happen quickly. Gasha: Why has it taken so long to get to serverless GPUs versus serverless CPUs? One of the biggest problems in serverless is what's called the cold boot time.