The New Stack Podcast cover image

Why the CNCF's New Executive Director is Obsessed With Inference

The New Stack Podcast

00:00

Cold Start and Right-Sized Models

They discuss cold-start latency, smaller task-specific models, and avoiding overuse of giant LLMs for simple queries.

Play episode from 08:08
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app