AWS Podcast cover image

#608: Generative AI Roundup - August 2023

AWS Podcast

00:00

How to Reduce Training Costs With AWS Inferentia 2

AWS Inferentia 2 is an accelerator that's designed by AWS to deliver high performance at the lowest cost for your deep learning inference applications. It gives you four times higher throughput and 10 times lower latency compared to Inferentia. And we expect this to help customers get about 40% lower training costs. Now, with any new domain, there's a lot to learn and often it's daunting. So we've released some training that should help.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app