Data Engineering Podcast cover image

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

00:00

Reprocessing in Batch Processes

In batch processes, you have a definitive start and end step of i only loaded up to this point of data. In the streaming context, you always have new data coming in. So when you want to kick off a re process of everything, i need to start at whatever the beginning of that happens to be. But i'm continually playing catch up with these new events that are coming through. Luka: What have been some useful approaches to be able to appropriately scale out, or parallelyze, or apply windowing?

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app