Data Engineering Podcast cover image

Building Auditable Spark Pipelines At Capital One

Data Engineering Podcast

00:00

Transactional Processing

I'm wondering if you have looked at leaning on anything, such as the deltalake format for being able to materialize this information as a set of tables. Or if the auditability is only there for so regulatory purposes, you don't necessarily need to keep it live and queriable in a easy to access form a yujst. Need to be able to maintain that history and perhaps be able to compact at overtime. For this particulu case, we haven't really persisted at each stage. What we have been doing is we havebeen enriching so the data set actually grows.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app