Changelog Master Feed cover image

Large models on CPUs (Practical AI #221)

Changelog Master Feed

00:00

The Cost of Deployment on GPUs and CPUs

When people get into larger deployments on ML and neural networks, the cost significantly shifts. You're for a lot of larger enterprises that are actively deploying 80, 90% of their costs is purely in deployment on these machines. So there can be a significant reduction once you're at that scale. As soon as you go into deployment, model optimization is a great thing to start because it's essentially just free performance that's left on the table That can significantly affect your bottom line.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app