
EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies
Gnarly Data Waves by Dremio
00:00
How to Cluster Data Using Z-Order
The good news is that you can do obviously all this with Z-order clustering, okay? So I won't go in depth with the mat in this particular session of Z-order and how does Z-order value is calculated. But by definition, it's a type of a space filling curve that tries to keep similar data points together when back from a similar higher dimension to a lower dimension. For example, 2D to 1D or 3D to 2D, right? What happens is when you apply the Z-order algorithm, it basically calculates a value called Z-value,. And then it tries to organize the data based on that Z-value similarity.
Play episode from 29:30
Transcript


