12min chapter

Kubernetes Podcast from Google cover image

Working Group Serving, with Yuan Tang and Eduardo Arango

Kubernetes Podcast from Google

CHAPTER

Empowering AI Model Serving in Kubernetes

This chapter introduces a new workgroup dedicated to enhancing AI model serving within the Kubernetes ecosystem, emerging from discussions at KubeCon Europe. The speakers discuss challenges such as startup times and the limitations of Kubernetes APIs, while emphasizing the group's mission to optimize workloads for AI inference and leveraging collaborations across the community. Additionally, it illuminates the complexities introduced by generative AI and explores potential solutions like dynamic resource allocation to improve multi-GPU and multi-node workload management.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode