4min chapter

MLOps.community  cover image

The Long Tail of ML Deployment // Tuhin Srivastava // MLOps Podcast #161

MLOps.community

CHAPTER

How to Choose the Right Horizontal Scaling Setup for Height Traffic Partners

As you scale, board the things increase, which is the compute as well as the storage. You're talking about deploying things that are serving things that add in a way that'd be hard to serve in the past. And so whether that be, right now we're trying to deploy a, I think it's like a 60 billion parameter model with floating point 32, FP32 or something like that,. The largest use case is almost like a consumer use case right now, is at like a scale that we haven't thought about.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode