Data Engineering Podcast cover image

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

00:00

Reprocessing in Batch Processes

In batch processes, you have a definitive start and end step of i only loaded up to this point of data. In the streaming context, you always have new data coming in. So when you want to kick off a re process of everything, i need to start at whatever the beginning of that happens to be. But i'm continually playing catch up with these new events that are coming through. Luka: What have been some useful approaches to be able to appropriately scale out, or parallelyze, or apply windowing?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app