Gnarly Data Waves by Dremio cover image

EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies

Gnarly Data Waves by Dremio

00:00

Z-Order Clustering for Faster Querying

Using Z-order clustering, you can now be efficient with filter on multiple column. Apache I-Zbip provides the ability to organize the layer of the data using Z-order technique out of the box. You don't have to implement this on your own. There is an API already there. And even if you're using something like Spark, there is even a stored procedure tool that literally just call the procedure and apply Z-order.

Play episode from 31:14
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app