
EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies
Gnarly Data Waves by Dremio
00:00
How to Optimize Your Data Leaks With Iceberg
Small data files can cause unnecessary amount of metadata to the file. And that's a problem, right? It can lead to like performance issue as well. To tackle all of these issues, we need to have a way to compact the small files. iceberg provides mechanism out of the box to do so. metrics based filtering is another way to read less file and skip them.
Play episode from 18:02
Transcript


