Gnarly Data Waves by Dremio cover image

EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies

Gnarly Data Waves by Dremio

00:00

How to Cluster Data Using Z-Order

The good news is that you can do obviously all this with Z-order clustering, okay? So I won't go in depth with the mat in this particular session of Z-order and how does Z-order value is calculated. But by definition, it's a type of a space filling curve that tries to keep similar data points together when back from a similar higher dimension to a lower dimension. For example, 2D to 1D or 3D to 2D, right? What happens is when you apply the Z-order algorithm, it basically calculates a value called Z-value,. And then it tries to organize the data based on that Z-value similarity.

Play episode from 29:30
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app