

Apache Spark Integration and Platform Execution for ML - ML 073
May 26, 2022
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11
Introduction
00:00 • 2min
Spark and Hadoop - What's the Difference?
02:00 • 5min
How to Optimize a Markov Chain Problem?
07:26 • 3min
Datarix Spark
10:14 • 4min
Lazy Evaluation Is Really Powerful
14:08 • 5min
Spark Clustering - Partitioning by Columns?
19:00 • 2min
Why Are Rows Split Up Instead of Columns?
20:51 • 2min
Spark SQL
23:18 • 3min
Spark
26:24 • 3min
Spark ML vs Spark RDD?
28:59 • 4min
How to Concatenate Lists in Python?
32:34 • 4min