Data Engineering Podcast cover image

Building Auditable Spark Pipelines At Capital One

Data Engineering Podcast

00:00

Is There a Priority Scheduling Between the Different Spark Clusters?

In terms of your specific work flow, you mentioned that you're dealing with processing some of the purchase information to calculate what the rewards are that are going to be available to a customer. You know, that as opposed to a fraud situation. I'm sure that there are different requirements in terms of the latency that's acceptable forent end processing. And i'm wondering if there are sort of prior ations, or different spark clusters that are available to make sure that, you know, the fraud analysis doesn't get held up behind the rewards computation. Or any sort of kind of priority scheduling across the different spark inver structure that's available and how that manifests in terms of what the processing capacity is

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app