DataNation - Podcast for Data Engineers, Analysts and Scientists cover image

52 – Apache Iceberg, Dremio and PuppyGraph

DataNation - Podcast for Data Engineers, Analysts and Scientists

00:00

Efficiency and Flexibility of Apache Iceberg for Data Lakes

Explore how Apache Iceberg as a central data format in a data lake house can overcome data duplication challenges, facilitate vector databases, and enable diverse data modeling possibilities like graphs, vectors, and relational data. Future developments like Havasu for geospatial queries and a document database infrastructure based on Apache Iceberg are hinted at.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app