AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Separating Query Engine from Data Layer in Databases
A fascinating and transformative idea is to separate the query engine from the data layer in databases, like what Iceberg did in the Hadoop space. Tabular aims to build a business around storing and managing data to make it easy to plug in any engine. Similar to TileDB, the focus is on making the data the important factor while intelligent functions happen in the data layer. Surprising use cases of the Iceberg format include using a large elastic search cluster for a single identifier lookup.
Cloud data warehouses have unlocked a massive amount of innovation and investment in data applications, but they are still inherently limiting. Because of their complete ownership of your data they constrain the possibilities of what data you can store and how it can be used. Projects like Apache Iceberg provide a viable alternative in the form of data lakehouses that provide the scalability and flexibility of data lakes, combined with the ease of use and performance of data warehouses. Ryan Blue helped create the Iceberg project, and in this episode he rejoins the show to discuss how it has evolved and what he is doing in his new business Tabular to make it even easier to implement and maintain.
The intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA
Sponsored By:
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode