The InfoQ Podcast cover image

How to Use Apache Spark to Craft a Multi-Year Data Regression Testing and Simulations Framework

The InfoQ Podcast

00:00

Data sources and Spark efficiency

Vivek recommends datasets Spark can read in parallel; JDBC-style reads are less efficient for large replays.

Play episode from 24:42
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app