

Generative AI on Kubernetes
7 snips Mar 12, 2024
Explore the integration of Generative AI models on Kubernetes with expert Janakiram MSV. Dive into the challenges faced in running LLM models on NVIDIA GPUs, lessons learned, and the evolution of managing AI models. Also, discover insights on market trends, acquisitions in cloud native solutions, and optimizing inference engines on Kubernetes for efficient model deployment.
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7
Introduction
00:00 • 2min
Acquisitions in Cloud Native Solutions and Kubernetes Cost Benchmark Report
02:06 • 12min
Discussion on Kubernetes and Market Trends with a Market Research Analyst
14:27 • 2min
The Evolution of Generative AI and Kubernetes Integration
16:40 • 14min
AI Integration with Kubernetes and GPUs in Cloud-Native Environments
30:28 • 26min
Optimizing Inference Engines on Kubernetes with TGI X Gen and More
56:43 • 15min
Practical Insights on Jenny Eye on Kubernetes and Event Discount Offer in New York City
01:11:27 • 4min