#61635
Mentioned in 1 episodes

Learning Spark

A beginner's guide to real-time Big Data processing using the Apache Spark framework
Book • 2020
Updated to include Spark 3.

0, this second edition teaches data engineers and data scientists how to leverage structure and unification in Spark for performing simple and complex data analytics and employing machine learning algorithms.

The book provides step-by-step walk-throughs, code snippets, and notebooks covering batch and streaming data analytics with Structured Streaming, building reliable data pipelines with Delta Lake, and developing machine learning pipelines with MLlib and MLflow.

Mentioned by

Mentioned in 1 episodes

Mentioned by
undefined
Demetrios Brinkmann
when introducing
undefined
Jules Damji
, one of the co-authors.
70 snips
Conversation with the MLflow Maintainers

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app