The Joe Reis Show

DuckLake w/ Hannes Mühleisen - Practical Data Lunch and Learn. June 4, 2025

16 snips
Jun 4, 2025
Discover the innovative Duck Lake extension of DuckDB, designed to revolutionize data management. The episode highlights impressive architecture aimed at enhancing database performance and interoperability. Enjoy a live demo showcasing unique features like effortless data manipulation and intriguing time travel functionalities. Hannes Mühleisen answers questions, making complex topics accessible and engaging!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

DuckLake's Origin Story

  • Hannes developed the idea of DuckLake during a conversation about OpenTable formats in Paris.
  • He realized existing formats had issues and started DuckLake to improve on them.
INSIGHT

Challenges with OpenTable Formats

  • OpenTable formats initially ignored by DuckDB, but demand forced exploration.
  • Realized Avro complexity and tech stack issues slowed adoption and inspired DuckLake's design.
INSIGHT

Key Innovation in DuckLake

  • DuckLake improves OpenTable formats by using a full SQL database for metadata management.
  • This approach fixes commit speed and small file problems and reduces system complexity.
Get the Snipd Podcast app to discover more snips from this episode
Get the app