The New Stack Podcast cover image

Is Apache Spark Too Costly? An Amazon Engineer Tells His Story

The New Stack Podcast

00:00

Migrating from Apache Spark: Challenges of Exabyte-Scale Data Management

This chapter explores the challenges of migrating exabyte-scale data from Apache Spark to RAID systems, highlighting the historical context of deprecating an Oracle data warehouse. It uses customer order tracking as an example to underscore the importance of accurate and accessible data for business operations at Amazon.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app