5min chapter

The Data Stack Show cover image

70: The Difference Between Data Lakes and Data Warehouses with Vinoth Chandar of Apache Hudi

The Data Stack Show

CHAPTER

Stream Processing and Databases in the Data Lake

We borrowed a lot from all the databases as well. Like indexes for example, we have an interesting problem for CDC right. If you so okay you have an upstream like Oracle or Cassandra some oil DP databases taking rights. And I invested a lot and more sort of like so this problem is similar to running a flink job reading from like a Kafka CDC and then updating a state show essentially stream processing principle. We are trying to add more capabilities on the lake than even a typical warehouse today.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode