Data Engineering Podcast cover image

Let The Whole Team Participate In Data With The Quilt Versioned Data Hub

Data Engineering Podcast

00:00

How Do I Create a Top Hash for a Package?

The canonical location of the data is not only like a namespace, okay, a repository, which are an S3 bucket and then a package name. The default hash for S3 is MB5. So you can, you can develop collisions pretty easily. There's different ways to fingerprint data. But getting hashing consistency has been the biggest trick.

Play episode from 39:59
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app