

At scale, anything that could fail definitely will
Sep 3, 2024
Pradeep, an expert in global-scale systems and cloud technology, shares insights on preparing for system failures with robust security layers. He emphasizes treating VMs as untrustworthy to enhance resilience. The conversation shifts to the future of cloud computing, especially as GenAI becomes integral to technology stacks. Pradeep also discusses strategies for building reliable cloud infrastructure and managing outages, highlighting lessons from his experiences at major tech companies.
Chapters
Transcript
Episode notes
1 2 3 4 5
Intro
00:00 • 2min
From Gaming to Cloud: A Journey through Software Engineering
01:54 • 3min
Building Resilient Cloud Infrastructure
05:18 • 16min
Navigating the Intersection of Cloud Computing and AI Innovation
21:47 • 2min
Engineering Challenges and Opportunities in Modern AI Workloads
23:52 • 6min