
#600: Amazon SageMaker Multi Model Endpoints
AWS Podcast
How SageMaker MME Can Help You Scale Your Models
SageMaker supports automatic scaling for your hosted models. Automatically, it can dynamically adjust the number of instances provision for your model in response to your changing workload. With MME, you can have multiple models using the same instance. Customers can effectively make use of these metrics to do a benchmarking on how many models they can host on MME.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.