The New Stack Podcast cover image

Kubernetes GPU Management Just Got a Major Upgrade

The New Stack Podcast

00:00

Small models, MIG, and resource slicing

Kevin and Jesse explain MIG, dynamic GPU slicing via DRA, and trade-offs between specialized and large models.

Play episode from 29:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app