How to Reduce Training Costs With AWS Inferentia 2

AWS Inferentia 2 is an accelerator that's designed by AWS to deliver high performance at the lowest cost for your deep learning inference applications. It gives you four times higher throughput and 10 times lower latency compared to Inferentia. And we expect this to help customers get about 40% lower training costs. Now, with any new domain, there's a lot to learn and often it's daunting. So we've released some training that should help.

Play episode from 17:45

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app