Do You Have a Coordinated CDC Pattern?

The current state of the art for CDC patterns is try and do either the copy on right or merge on read approach. But in that case, you've got a Kafka topic that's partitioned across 15 different partitions. And who knows what points and time all those records are in? So if you randomly batch them up through like a flink or Kafka connect process, the downstream table actually has Like a completely random table state that doesn't coordinate with basically any time in the upstream table. That really fascinates me. I'm like, how is anyone having success with these CDC tables that are basically like arandom table state that is not actually a point in time? The industry needs to lead

Play episode from 39:59

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse

Data Engineering Podcast

Do You Have a Coordinated CDC Pattern?

Summary

Announcements

Interview

Contact Info

Parting Question

Closing Announcements

Links

The AI-powered Podcast Player