Adding Statistics to Hadup Data Warehouses

When hadup first came on to the scene, you weren't really indexing all the data in the lake. The advent of things like parque and o r c for adding columner storage in the data lake has improved scanning efficiency. And we've done work on the formet itself, so you are able to read byke full compressed data. You need to scale tat daabas that datbes serves you pors, streaming tors in lar and upsover. We've built a the capita kevari store for ditalect. So asivly create an index at store its data on object storage like astre, and then you are loading the index in

Play episode from 18:25

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

Adding Statistics to Hadup Data Warehouses

Summary

Announcements

Interview

Contact Info

Parting Question

Closing Announcements

Links

The AI-powered Podcast Player