A Graph Database That You Can Embed - KuzuDB

19 snips

Mar 27, 2024

In a compelling discussion, Semih Salihoglu, an Associate Professor at the University of Waterloo and CEO of KuzuDB, dives into the world of graph databases. He unveils the journey of KuzuDB from academic roots to an innovative startup. The conversation reveals when to choose a graph database, KuzuDB's unique features compared to traditional systems, and advanced query optimization techniques. Salihoglu also shares insights on handling data ingestion and write operations, highlighting KuzuDB's efficiency and future aspirations in the data landscape.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

From Graph Analytics to Graph Databases

Semih Salihoglu, inspired by Mike Stonebreaker's Turing Award lecture, shifted from large-scale graph analytics to graph databases.
His frustration with the lack of practical application of his PhD work led him to explore the then-nascent field of graph databases.

INSIGHT

KuzuDB's Inspiration and Focus

KuzuDB is inspired by DuckDB's embeddable nature and SQLite.
It focuses on graph-specific data, offering scalability and speed for highly connected data.

INSIGHT

Graph Databases and Scalability

Graph databases, historically used for analytics, excel in query-oriented applications.
Scalability in this context refers to managing and querying large datasets efficiently on a single node.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

In this video I talk to Semih Salihoglu about KuzuDB : A highly scalable, extremely fast, easy to use embeddable Graph Database. Chapters: 00:00 Introduction 00:40 The Genesis of KuzuDB: From Academic Research to Startup 06:40 Graph Databases 101: Understanding the Basics and Beyond 10:24 When to Opt for a Graph Database: Use Cases and Advantages 19:16 KuzuDB vs. Traditional Databases: A Comparative Analysis 24:39 Inside KuzuDB: Optimizations and Data Ingestion Explained 31:08 Exploring Query Optimizations in Graph Databases 31:34 The Relational Nature of Graph Databases 33:33 Factorization: A Key Optimization Technique 38:50 Integrating New Data Sources and Handling Joins 43:39 Optimizing Write Operations and Index Management 50:23 Comparing Kuzu with Other Graph Databases 58:50 Future Developments and Vision for Kuzu Important links: - History of DBMSs and the IDS, which is the first database in history, which had a graph-based model: https://dl.acm.org/doi/abs/10.1145/1147376.1147382 is a good paper by CS historian on this history and a must read for everyone interested in the birth of databases as a field. - https://blog.kuzudb.com/post/what-every-gdbms-should-do-and-vision/ blog on the what every GDBMS should do and vision of Kùzu. - The user survey paper that got Semih into GDBMSs. https://arxiv.org/pdf/1709.03188.pdf - Blog on factorization https://blog.kuzudb.com/post/factorization/ - Kùzu's RDFGraphs feature https://docs.kuzudb.com/rdf-graphs/ =============================================================================== For discount on the below courses: Appsync: https://appsyncmasterclass.com/?affiliateId=41c07a65-24c8-4499-af3c-b853a3495003 Testing serverless: https://testserverlessapps.com/?affiliateId=41c07a65-24c8-4499-af3c-b853a3495003 Production-Ready Serverless: https://productionreadyserverless.com/?affiliateId=41c07a65-24c8-4499-af3c-b853a3495003 Use the button, Add Discount and enter "geeknarrator" discount code to get 20% discount. =============================================================================== Follow me on Linkedin and Twitter: https://www.linkedin.com/in/kaivalyaapte/ and https://twitter.com/thegeeknarrator If you like this episode, please hit the like button and share it with your network. Also please subscribe if you haven't yet. Database internals series: https://youtu.be/yV_Zp0Mi3xs Popular playlists: Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA- Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17 Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_d Modern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsN Stay Curios! Keep Learning!