Data Engineering Podcast cover image

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

00:00

Adding Statistics to Hadup Data Warehouses

When hadup first came on to the scene, you weren't really indexing all the data in the lake. The advent of things like parque and o r c for adding columner storage in the data lake has improved scanning efficiency. And we've done work on the formet itself, so you are able to read byke full compressed data. You need to scale tat daabas that datbes serves you pors, streaming tors in lar and upsover. We've built a the capita kevari store for ditalect. So asivly create an index at store its data on object storage like astre, and then you are loading the index in

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app