

What's Next for AI Infrastructure with Amin Vahdat | AI Basics with Google Cloud
135 snips May 1, 2025
In this discussion, Amin Vahdat, VP of ML at Google Cloud, shares his insights on the groundbreaking infrastructure behind AI. He explains how Google’s TPUs are revolutionizing real-time data processing and enhancing AI capabilities. Vahdat predicts 2025 as the pivotal 'Year of Inference' for startups. He highlights advancements in low-code development, the evolution of AI agents, and the transformative potential of cloud computing. Dive in to explore how technology is reshaping decision-making and creating opportunities for entrepreneurs!
AI Snips
Chapters
Books
Transcript
Episode notes
Parallel AI Powers Deep Research
- Jason Calacanis shared a personal experience using Google Gemini for deep research on music.
- He was amazed by how the AI worked in parallel to quickly answer complex queries.
TPUs Enable Massive Real-Time Compute
- AI queries run on thousands of standard servers plus powerful TPUs that pack 100 servers' compute into one chip.
- Complex queries involve many subqueries orchestrated in real time to deliver answers.
Imagine Beyond Current Limits
- Internet bandwidth and compute have exploded, enabling previously unthinkable applications.
- Founders should imagine what they want to build and trust infrastructure can handle it efficiently soon.