

DuckLake w/ Hannes Mühleisen - Practical Data Lunch and Learn. June 4, 2025
16 snips Jun 4, 2025
Discover the innovative Duck Lake extension of DuckDB, designed to revolutionize data management. The episode highlights impressive architecture aimed at enhancing database performance and interoperability. Enjoy a live demo showcasing unique features like effortless data manipulation and intriguing time travel functionalities. Hannes Mühleisen answers questions, making complex topics accessible and engaging!
AI Snips
Chapters
Transcript
Episode notes
DuckLake's Origin Story
- Hannes developed the idea of DuckLake during a conversation about OpenTable formats in Paris.
- He realized existing formats had issues and started DuckLake to improve on them.
Challenges with OpenTable Formats
- OpenTable formats initially ignored by DuckDB, but demand forced exploration.
- Realized Avro complexity and tech stack issues slowed adoption and inspired DuckLake's design.
Key Innovation in DuckLake
- DuckLake improves OpenTable formats by using a full SQL database for metadata management.
- This approach fixes commit speed and small file problems and reduces system complexity.