AWS Podcast cover image

#608: Generative AI Roundup - August 2023

AWS Podcast

CHAPTER

How to Reduce Training Costs With AWS Inferentia 2

AWS Inferentia 2 is an accelerator that's designed by AWS to deliver high performance at the lowest cost for your deep learning inference applications. It gives you four times higher throughput and 10 times lower latency compared to Inferentia. And we expect this to help customers get about 40% lower training costs. Now, with any new domain, there's a lot to learn and often it's daunting. So we've released some training that should help.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner