
Apache Spark Integration and Platform Execution for ML - ML 073
Adventures in Machine Learning
00:00
How to Concatenate Lists in Python?
In Scala, that collection is they're all immutable under the covers. So it's not like you're actually mutating and then modifying that RDD. The system tries to minimize the amount of shuffles that need to happen. There's still a couple more things on architecture and sort of data transformations. It's still limited by new Tony in physics, though. He has seen some truly astronomically large clusters started during his time at Databricks.
Transcript
Play full episode