Everything Product Podcast cover image

Everything Product Podcast

Scaling AI for Billions: Meta’s Distributed Inference & Capacity Planning Secrets

Mar 2, 2025
47:21

Join us for an in-depth conversation with Rahul Rai, an Infrastructure Product Manager at Meta with over a decade of experience spanning Cisco, Samsung, Amazon, Microsoft, and Meta. In this episode, Rahul unpacks the complexities of AI model development—from training to inference—and reveals how product managers can navigate the technical and strategic challenges of building scalable systems that serve billions of users.


What You'll Learn:

• The fundamentals of AI training vs. inference and why distributed inference is a game changer.

• How to balance the needs of multiple stakeholders—from ML engineers to end users—in building robust, scalable products.

• Real-world insights into capacity planning and the creation of internal tools that impact global-scale operations.

• A candid comparison of PM cultures and career paths at Amazon, Microsoft, and Meta, along with tips for transitioning into an infrastructure PM role.

• Recommended resources and actionable advice for any PM looking to excel in high-impact technical roles.


Timestamps:

00:00 – Podcast Intro & Setup

01:46 – Meet Rahul Rai: Background and Career Journey

03:17 – AI 101: Training vs. Inference Fundamentals

06:24 – Deep Dive: Analogies and Real-World Examples of AI Inference

10:03 – Distributed Inference Explained: How Models Stay Current

12:34 – Cost Breakdown: Why Inference Drives 90% of AI Model Costs

17:13 – The Infra PM Role: Balancing Stakeholder Needs

22:43 – Building at Scale: Capacity Planning Tools for Billion-User Platforms

24:36 – Meta’s AI Strategy: Open Source Models and Product Integration

28:28 – Comparing Cultures: PM Roles at Amazon, Microsoft, & Meta

36:50 – Autonomy at Meta: Bottom-Up Problem Solving in Action

40:30 – Growth Opportunities: Essential Skills for Infra PMs

42:46 – Measuring Success: Metrics and Impact in Infrastructure PM

44:32 – Resources for PMs: "First 90 Days" & "Getting Stuff Done"

46:05 – Wrap-Up & Connect: How to Follow Rahul Rai

Recommended Tags:

#ProductManagement #InfrastructurePM #AIInference #DistributedInference #Meta #Amazon #Microsoft #TechLeadership #CapacityPlanning #OpenSourceAI #MachineLearning #ProductManager #TechPodcast

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner