Monday Morning Data Chat cover image

Monday Morning Data Chat

#150 - Nadine Farah - Apache Hudi Deep Dive

Nov 6, 2023
58:06
Snipd AI
Nadine Farah discusses Apache Hudi's core primitives like indexing and incremental processing. The podcast explores Hudi's role in data management, compliance, and interoperability. It also touches on Hootie ecosystem advancements and upcoming open source data summit talks.
Read more

Podcast summary created with Snipd AI

Quick takeaways

  • Apache Hudi offers table services for streamlined data management processes, ensuring data integrity.
  • Hoodie uniquely identifies each record via primary keys, maintaining partition-level uniqueness for fast updates and deletes.

Deep dives

Hoodie's Evolution and Purpose

Hoodie, an open-source project, emerged to aid Uber in scaling its analytics to manage petabyte data and handle real-time, large datasets. The project evolved to enhance analytical capabilities and manage streams of data efficiently, addressing challenges faced by companies like Uber in maintaining scalable data pipelines.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode