Reliability, Scalability, and Maintainability - Designing Data-Intensive Applications by Martin Kleppman

Jan 19, 2026

Dive into the intricacies of designing data-intensive applications with insights on reliability, scalability, and maintainability. Explore the Twitter scalability challenge and how chaos engineering fosters resilience in systems. Discover the implications of AI-generated code and its effects on the code review process. Learn about data models, query languages, and the trade-offs between various database strategies. Finally, get practical takeaways that highlight the timelessness of these concepts for engineers.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Design For Fault Tolerance

Faults do not equal failures; design systems to tolerate faults to avoid user-facing failures.
Carter Morgan emphasizes chaos engineering and planning for hardware, software, and human faults.

ADVICE

Invert Read-Heavy Workloads

Invert heavy read workflows: precompute or push updates for common reads rather than querying many followers on demand.
Use hybrid approaches for extreme cases like celebrity accounts to avoid massive fan-out writes.

INSIGHT

Measure Tail Latency Not Just Averages

Percentiles reveal tail latency problems that averages hide and matter for user experience.
Tail requests (P99.9) often correspond to your most valuable customers and deserve attention.

Get the Snipd Podcast app to discover more snips from this episode

Get the app