2min chapter

The MLOps Podcast cover image

🤗 Large ML models in production with HuggingFace CTO Julien Chaumond

The MLOps Podcast

CHAPTER

Is There a Real Time Inference Package Support?

We have this product in in private beta, like any one should feel free to get in touch. Weare looking for, like a lot of a betatestar. A w, without diving too much into the details, it's built on top of so twere such as triton. I don't know if you've heard of triton, which is like a pretty call like onis optimi run time. And the last question we have from the community is from cha a about how do you plan to support a real time inference? Ah, do you have an inference package support to support the distilled or smaller models?

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode