Kubernetes Bytes

Generative AI on Kubernetes

7 snips

Mar 12, 2024

Explore the integration of Generative AI models on Kubernetes with expert Janakiram MSV. Dive into the challenges faced in running LLM models on NVIDIA GPUs, lessons learned, and the evolution of managing AI models. Also, discover insights on market trends, acquisitions in cloud native solutions, and optimizing inference engines on Kubernetes for efficient model deployment.

Ask episode

Chapters

Transcript

Episode notes

Acquisitions in Cloud Native Solutions and Kubernetes Cost Benchmark Report

02:06 • 12min

Discussion on Kubernetes and Market Trends with a Market Research Analyst

The Evolution of Generative AI and Kubernetes Integration

16:40 • 14min

AI Integration with Kubernetes and GPUs in Cloud-Native Environments

30:28 • 26min

Optimizing Inference Engines on Kubernetes with TGI X Gen and More

56:43 • 15min

Practical Insights on Jenny Eye on Kubernetes and Event Discount Offer in New York City

01:11:27 • 4min