What's Next for AI Infrastructure with Amin Vahdat | AI Basics with Google Cloud

140 snips

May 1, 2025

In this discussion, Amin Vahdat, VP of ML at Google Cloud, shares his insights on the groundbreaking infrastructure behind AI. He explains how Google’s TPUs are revolutionizing real-time data processing and enhancing AI capabilities. Vahdat predicts 2025 as the pivotal 'Year of Inference' for startups. He highlights advancements in low-code development, the evolution of AI agents, and the transformative potential of cloud computing. Dive in to explore how technology is reshaping decision-making and creating opportunities for entrepreneurs!

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

ANECDOTE

Parallel AI Powers Deep Research

Jason Calacanis shared a personal experience using Google Gemini for deep research on music.
He was amazed by how the AI worked in parallel to quickly answer complex queries.

INSIGHT

TPUs Enable Massive Real-Time Compute

AI queries run on thousands of standard servers plus powerful TPUs that pack 100 servers' compute into one chip.
Complex queries involve many subqueries orchestrated in real time to deliver answers.

INSIGHT

Imagine Beyond Current Limits

Internet bandwidth and compute have exploded, enabling previously unthinkable applications.
Founders should imagine what they want to build and trust infrastructure can handle it efficiently soon.

Get the Snipd Podcast app to discover more snips from this episode

Get the app