
EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies
Gnarly Data Waves by Dremio
00:00
How to Cluster Data by Multiple Fields in a More Integrated Way
In an ideal world, when you group this particular data files by sorting, you want to have all the employees with similar data points stay together in a similar file. This is where things like Z-order clustering comes into picture. We're going to quickly discuss on Z- order clustering before we end the presentation. But before that, just to recap, we discussed two problems until now with metrics is filtering.
Play episode from 27:45
Transcript


