Data Engineering Podcast cover image

Be Confident In Your Data Integration By Quickly Validating Matching Records With data-diff

Data Engineering Podcast

00:00

Is There a Hashing Algorithm?

M d is one of the faster hashing algorithms. It also has of the very real potential of generating hash collisions. In reality, i would love to use something like x x hash, right? Which is about a hundred times faster and can be simd than m d five. The reason why we're using emty five is just because is ubiquitous,. I think in reality, what we might do at some point is to not even pash it, but literally just try to som if their integers are floating points. Because for data verification, that's usually sufficient.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner