

#61635
Mentioned in 1 episodes
Learning Spark
A beginner's guide to real-time Big Data processing using the Apache Spark framework
Book • 2020
Updated to include Spark 3.
0, this second edition teaches data engineers and data scientists how to leverage structure and unification in Spark for performing simple and complex data analytics and employing machine learning algorithms.
The book provides step-by-step walk-throughs, code snippets, and notebooks covering batch and streaming data analytics with Structured Streaming, building reliable data pipelines with Delta Lake, and developing machine learning pipelines with MLlib and MLflow.
0, this second edition teaches data engineers and data scientists how to leverage structure and unification in Spark for performing simple and complex data analytics and employing machine learning algorithms.
The book provides step-by-step walk-throughs, code snippets, and notebooks covering batch and streaming data analytics with Structured Streaming, building reliable data pipelines with Delta Lake, and developing machine learning pipelines with MLlib and MLflow.
Mentioned by
Mentioned in 1 episodes
Mentioned by 

when introducing ![undefined]()

, one of the co-authors.


Demetrios Brinkmann

Jules Damji

70 snips
Conversation with the MLflow Maintainers



