This Week in Startups cover image

This Week in Startups

What's Next for AI Infrastructure with Amin Vahdat | AI Basics with Google Cloud

May 1, 2025
In this discussion, Amin Vahdat, VP of ML at Google Cloud, shares his insights on the groundbreaking infrastructure behind AI. He explains how Google’s TPUs are revolutionizing real-time data processing and enhancing AI capabilities. Vahdat predicts 2025 as the pivotal 'Year of Inference' for startups. He highlights advancements in low-code development, the evolution of AI agents, and the transformative potential of cloud computing. Dive in to explore how technology is reshaping decision-making and creating opportunities for entrepreneurs!
27:34

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Google's TPU technology significantly enhances AI processing capabilities, empowering startups to tackle previously unimaginable projects with efficiency.
  • The shift from model training to inference highlights the necessity for immediate AI responsiveness, transforming how startups leverage AI for actionable insights.

Deep dives

The Scaling of AI Infrastructure

The infrastructure needed to support AI applications has grown significantly, with Google's cloud relying on custom accelerators like tensor processing units (TPUs). These TPUs can deliver the computing power equivalent to 100 standard servers in a single chip, allowing for massive parallel processing capabilities. This innovation supports complex queries and deep research by running numerous subqueries simultaneously to compile accurate answers. With the ability to coordinate thousands of these chips, Google has built one of the largest computing clusters, paving the way for enhanced AI applications and experiences.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app