Data Engineering Podcast cover image

Building Auditable Spark Pipelines At Capital One

Data Engineering Podcast

00:00

Using Apache Spark in a Data Processing Environment?

You mentioned the er meaninga on spark, and i know that you wrote a post a little while ago to talk about some of the patterns that you started with. I'm wondering if you can just talk through some of the system architecture and system design and some of the constraints that you're dealing with as far as how to think about designing the jobs that you're working with. So oer oer a data piplane. Primarily, as was previously mentioning, uses apaches part as its prassing famwork. And we do have mix of server and seveless options in our biplane, depending on the use gasses.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app