How AI Is Built  cover image

#9 Jorrit Sandbrink on Modern Data Infrastructure for Analytics and AI, Lakehouses, Open Source Data Stack

How AI Is Built

NOTE

Optimizing Data Lake Storage

Transitioning from file formats like parquet to table formats like Delta removes the need for managing files at the file level, enabling easier table extraction. However, optimizing the layout of files and tables is still essential for performance. While platforms like Databricks offer automated optimization, having a deeper understanding as a data engineer can be beneficial for cases where intervention is required, such as optimizing file size for query performance or partitioning.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner