Confluent Developer ft. Tim Berglund, Adi Polak & Viktor Gamov cover image

Optimizing Apache Kafka's Internals with Its Co-Creator Jun Rao

Confluent Developer ft. Tim Berglund, Adi Polak & Viktor Gamov

00:00

The Importance of Geo-Replication in Kafka

I thought you had the metadata for the cluster and every time it changed, you saved the whole thing to your topic. But actually, you're doing event sourcing on that metadata. So log is the one that keeps track over all the incremental changes. And then snapshot is sort of like an internal way for us to essentially bound the space of the log. We don't want that to keep growing for like forever. The second thing is for the process to rebuild the metadata state in a more efficient way.

Play episode from 40:47
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app