The InfoQ Podcast

InfoQ
undefined
4 snips
Oct 27, 2025 • 38min

Effective Error Handling: A Uniform Strategy for Heterogeneous Distributed Systems

Jenish Shah, a backend engineer at Netflix with expertise in distributed systems, shares insights on effective error handling in complex environments. He discusses the evolution of microservices and the significance of clear error messages for user experience. Jenish introduces his centralized exception library, designed for uniformity across various protocols like HTTP and gRPC. He emphasizes the importance of observability metrics in error detection and offers advice on building reusable solutions to enhance engineering consistency.
undefined
18 snips
Oct 22, 2025 • 51min

Cloud and DevOps InfoQ Trends Report 2025

Shweta Vohra, a lead architect at Booking.com and cloud-native expert, joins Stefian Weas, InfoQ lead editor, and Matt Saunders, head of DevOps at Adaptavist, to discuss pivotal trends in cloud and DevOps. They explore the rapid AI adoption in cloud services and the challenges of integrating legacy systems. Shweta emphasizes the rise of FinOps and Kubernetes' dominance, while Matt reflects on the growing executive interest in developer platforms. The panel predicts a shift towards maturing digital sovereignty and the significance of optimizing cloud costs amidst AI budgeting pressures.
undefined
21 snips
Oct 13, 2025 • 52min

Mental Models in Architecture & Societal Views of Technology: A Conversation with Nimisha Asthagiri

Nimisha Asthagiri, Global Director in Data and AI at ThoughtWorks, chats about the significance of systems thinking and mental models in architecture. She reveals how misalignments in technology can lead to societal issues, emphasizing responsible AI practices. Nimisha discusses scaling multi-agent systems and the delicate balance of human involvement in tech operations. She envisions a harmonious relationship with AI that respects human agency, and shares her passion for simplifying complex designs while advocating for better architectural outcomes.
undefined
Oct 6, 2025 • 36min

Elena Samuylova on Large Language Model (LLM) Based Application Evaluation and LLM as a Judge

Elena Samuylova, Co-founder and CEO of Evidently AI, shares her expertise in LLM-powered application evaluation. She highlights the importance of distinguishing between model and system evaluations. Elena introduces the concept of using an LLM as a judge, discussing its benefits and limitations. She emphasizes the workflow for LLM evaluation, including iterative checks and stress testing. Furthermore, she advises on designing custom LLM judges and stresses the significance of team roles in this process, encouraging developers to adapt their skills as the field evolves.
undefined
Sep 29, 2025 • 42min

The Hidden Vulnerability of The Open Source Software Supply Chain: The Underlying Infrastructure

Brian Fox, CTO and co-founder of Sonatype and a key figure in open-source projects like Maven, dives into the implications of the EU Cyber Resilience Act. He discusses the hidden risks it poses to open-source maintainers, highlighting potential legal liabilities and sustainability challenges for registries. Fox reveals how major cloud providers account for much of Maven Central's traffic and suggests innovative solutions like repository managers and cost-structures to tackle inefficiencies in software consumption. His insights are critical for navigating today’s complex software landscape.
undefined
19 snips
Sep 24, 2025 • 53min

AI, ML, and Data Engineering InfoQ Trends Report 2025

Savannah Kounowski, Managing Director at IDEO, explores how human-centered design influences technology, emphasizing the need for simpler interfaces in generative AI adoption. Daniel Dominguez, managing partner at SunX Labs, discusses the importance of on-device models for privacy and cost. They delve into the rise of multimodal AI, everyday robotics, and the role of language models in robotic planning. Insights include AI's impact on software development, the shift to no-code platforms, and the need for trust in AI products.
undefined
30 snips
Sep 15, 2025 • 46min

Scaling Systems, Companies, and Careers with Suhail Patel

In this conversation, Suhail Patel, a Principal Engineer at Monzo, discusses the rapid scaling of the fintech platform and the core infrastructure. He shares insights on the evolution of microservices and the importance of automation in supporting growth. Patel emphasizes securing buy-in from teams to showcase true platform value and the necessity of minimum viable architectures. He also dives into team dynamics, highlighting communication's role in leadership and the integration of AI in enhancing engineering practices.
undefined
29 snips
Sep 8, 2025 • 60min

Safely Changing Software to Avoid Incidents: A Conversation with Justin Sheehy

In a captivating conversation, Justin Sheehy, Chief Architect at Akamai, shares insights on making software safer and more resilient. He discusses the futility of root cause analysis and stresses the importance of a shared language for incident discussions. The need for malleable and observable software is highlighted, along with the understanding that all technology decisions are inherently business-oriented. Sheehy also addresses how AI's rise complicates engineers' abilities to handle production incidents, making resilience even more crucial.
undefined
13 snips
Sep 1, 2025 • 36min

Observability in Java with Micrometer - a Conversation with Marcin Grzejszczak

Marcin Grzejszczak, a key player in the Spring and Micrometer open source teams, shares insights on observability in distributed systems. He discusses the shift from monolithic applications to microservices, emphasizing the crucial role of monitoring. The conversation explores the evolution of observability tools, like OpenTelemetry, and how Micrometer enhances metrics collection. Marcin also addresses the business implications of observability practices, including cost considerations and the necessity of effective context propagation for seamless service communication.
undefined
12 snips
Aug 25, 2025 • 43min

Why Rust Will Help You Deliver Better Low-latency Systems and Happier Developers

Andrew Lamb, a Staff Engineer at Influx Data with extensive experience in Rust and low-level systems, discusses why Rust is ideal for low-latency development. He shares insights on Rust's memory safety and productivity benefits compared to traditional languages like C/C++. The conversation touches on challenges with cloud integration, emphasizing the importance of caching strategies for real-time data access. Lamb also highlights collaboration in database development using the FDAP stack and the role of tools like 'rustfmt' in enhancing code quality.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app