Gnarly Data Waves by Dremio cover image

EP10 - Optimizing Data Files in Apache Iceberg Performance Strategies

Gnarly Data Waves by Dremio

00:00

How to Optimize Your Data Leaks With Iceberg

Small data files can cause unnecessary amount of metadata to the file. And that's a problem, right? It can lead to like performance issue as well. To tackle all of these issues, we need to have a way to compact the small files. iceberg provides mechanism out of the box to do so. metrics based filtering is another way to read less file and skip them.

Play episode from 18:02
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app