The Cost of Deployment on GPUs and CPUs

When people get into larger deployments on ML and neural networks, the cost significantly shifts. You're for a lot of larger enterprises that are actively deploying 80, 90% of their costs is purely in deployment on these machines. So there can be a significant reduction once you're at that scale. As soon as you go into deployment, model optimization is a great thing to start because it's essentially just free performance that's left on the table That can significantly affect your bottom line.

Play episode from 04:54

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app