Adventures in Machine Learning cover image

Apache Spark Integration and Platform Execution for ML - ML 073

Adventures in Machine Learning

00:00

Spark and Hadoop - What's the Difference?

Spark is an evolutionary step on top of the MapRedis paradigm that was introduced by Yahoo and a bunch of universities when they created the Hadoop system. Instead of it being a disk-based operation system, Spark's effectively issuing mapping commands over files located in a file store. Because everything is in memory, this opened up amazing possibility for large-scale computing tasks. The optimization comes in when you're actually having to do things that are not a demonstration, or going through a tutorial, real-world problems.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app