Data Engineering Podcast cover image

Building Auditable Spark Pipelines At Capital One

Data Engineering Podcast

00:00

The Filtering Approach Is Not Enriching Pattern

The data set shrinks in rows when we started with a number of rows aten be finally ending up with two rows as eligible transaction for that was actually our initial implementation. In retrospect, if we have to back trace any one transaction, right for those why it got filterd or which stage it got filtered. That grandrarity is what was lacking in the filtering approach that we immediately found out as a shortcoming. And then te immediately switch over to a pattern, which is case mostlyca witergase enriching pattern. But how we are reaching to that state is what makes it difference, based on their use case,.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app