
EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies
Gnarly Data Waves by Dremio
00:00
Z-Order Clustering for Faster Querying
Using Z-order clustering, you can now be efficient with filter on multiple column. Apache I-Zbip provides the ability to organize the layer of the data using Z-order technique out of the box. You don't have to implement this on your own. There is an API already there. And even if you're using something like Spark, there is even a stored procedure tool that literally just call the procedure and apply Z-order.
Play episode from 31:14
Transcript


