Fork Around And Find Out cover image

Fork Around And Find Out

Getting to Know Kafka with Elad Eldor

Mar 7, 2025
Elad Eldor, author of 'Kafka Troubleshooting in Production' and a Data Ops Engineer at Unity, shares his wealth of knowledge about Kafka. He discusses the differences between running Kafka on-prem and in the cloud, unveiling the complexities of cluster management and performance tuning. Elad emphasizes the importance of understanding system bottlenecks and resource optimization to avoid excessive costs. He also touches on the challenges of manual monitoring and the interplay between human expertise and technology in today’s operational landscape.
56:36

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Running Kafka on-premises allows for tailored hardware solutions but carries risks like hardware failures that can impact clusters.
  • Kafka serves as a versatile pub/sub messaging system, enabling efficient data flow integration while requiring careful scaling management to avoid bottlenecks.

Deep dives

Understanding Kafka: The Challenges of On-Premises vs. Cloud

Working with Kafka presents unique challenges that differ significantly between on-premises and cloud environments. On-premises setups provide greater control over hardware configurations, allowing for tailored solutions, but they also come with substantial risks, such as hardware failures that can cripple entire clusters. In contrast, while cloud environments offer scalability and managed solutions, they often limit fine-tuning capabilities and can obscure underlying issues due to abstracted services. This creates a steep learning curve for engineers transitioning from on-premise to cloud architectures, necessitating a solid understanding of the system's components, performance metrics, and common troubleshooting strategies.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app