AWS Podcast cover image

#600: Amazon SageMaker Multi Model Endpoints

AWS Podcast

CHAPTER

How SageMaker MME Can Help You Scale Your Models

SageMaker supports automatic scaling for your hosted models. Automatically, it can dynamically adjust the number of instances provision for your model in response to your changing workload. With MME, you can have multiple models using the same instance. Customers can effectively make use of these metrics to do a benchmarking on how many models they can host on MME.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner