Data Engineering Podcast cover image

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

00:00

Adding Statistics to Hadup Data Warehouses

When hadup first came on to the scene, you weren't really indexing all the data in the lake. The advent of things like parque and o r c for adding columner storage in the data lake has improved scanning efficiency. And we've done work on the formet itself, so you are able to read byke full compressed data. You need to scale tat daabas that datbes serves you pors, streaming tors in lar and upsover. We've built a the capita kevari store for ditalect. So asivly create an index at store its data on object storage like astre, and then you are loading the index in

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app