Kubernetes Bytes

Generative AI on Kubernetes

7 snips
Mar 12, 2024
Explore the integration of Generative AI models on Kubernetes with expert Janakiram MSV. Dive into the challenges faced in running LLM models on NVIDIA GPUs, lessons learned, and the evolution of managing AI models. Also, discover insights on market trends, acquisitions in cloud native solutions, and optimizing inference engines on Kubernetes for efficient model deployment.
Ask episode
Chapters
Transcript
Episode notes