Data Engineering Podcast cover image

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-diff

Data Engineering Podcast

00:00

Is There a Hashing Algorithm?

M d is one of the faster hashing algorithms. It also has of the very real potential of generating hash collisions. In reality, i would love to use something like x x hash, right? Which is about a hundred times faster and can be simd than m d five. The reason why we're using emty five is just because is ubiquitous,. I think in reality, what we might do at some point is to not even pash it, but literally just try to som if their integers are floating points. Because for data verification, that's usually sufficient.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app