AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Do You Have a Coordinated CDC Pattern?
The current state of the art for CDC patterns is try and do either the copy on right or merge on read approach. But in that case, you've got a Kafka topic that's partitioned across 15 different partitions. And who knows what points and time all those records are in? So if you randomly batch them up through like a flink or Kafka connect process, the downstream table actually has Like a completely random table state that doesn't coordinate with basically any time in the upstream table. That really fascinates me. I'm like, how is anyone having success with these CDC tables that are basically like arandom table state that is not actually a point in time? The industry needs to lead