Gnarly Data Waves by Dremio cover image

EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies

Gnarly Data Waves by Dremio

00:00

How to Cluster Data by Multiple Fields in a More Integrated Way

In an ideal world, when you group this particular data files by sorting, you want to have all the employees with similar data points stay together in a similar file. This is where things like Z-order clustering comes into picture. We're going to quickly discuss on Z- order clustering before we end the presentation. But before that, just to recap, we discussed two problems until now with metrics is filtering.

Play episode from 27:45
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app