The Circuit

Episode 120: NVIDIA Earnings, The Future of AI inference, China AI and more

24 snips
Jun 2, 2025
Austin Lyons, an expert in AI and technology trends, joins to discuss Nvidia's recent earnings and their implications for the future of AI. They dive into the impact of U.S. regulations on sales to China and the strategic pivot towards AI inference. Lyons highlights the surge in demand for Nvidia's systems driven by giants like OpenAI and Microsoft. The conversation also covers AI's role in enhancing business decision-making, real-world applications, and the challenges of ensuring effective integration and user-friendly interfaces.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Nvidia Shifting To AI Inference

  • Nvidia is transitioning from training-focused AI workloads to a growing inference market.
  • Customers are validating demand by buying Nvidia systems specifically to run AI inference workloads.
INSIGHT

AI Token Growth vs Nvidia Market Share

  • AI token generation usage is skyrocketing, with Microsoft processing over 100 trillion tokens in Q1.
  • Despite growth, it's unclear how much of this inference market Nvidia will retain long term.
INSIGHT

Inference Costs Drive Alternatives

  • High-scale inference costs push hyperscalers to develop alternatives like Google's TPU.
  • Personal AI use suggests future AI must deeply know individuals, likely requiring on-device processing.
Get the Snipd Podcast app to discover more snips from this episode
Get the app