The Cloudcast

Sizing AI Workloads

21 snips
Apr 24, 2024
John Yue, CEO & Co-Founder @ inference.ai, delves into AI workload sizing, matching GPUs to workloads, and the complexities of AI/ML hosting. Topics include business considerations in GPU selection, challenges in AI hardware procurement, and the importance of tailored solutions for varying workload demands.
Ask episode
Chapters
Transcript
Episode notes