
What Does Apache Arrow Unlock for Analytics? (w/ Wes McKinney)
The Analytics Engineering Podcast
00:00
Why Data Work Is Slow?
There's a couple main components to why stuff is slow, like why data work is slow. I think there are two big ones: disk access reading, writing from disk and maybe important in this context is like converting between memory formats. And so let's say that you take the earlier two categories and you like zero them out, they're like gone. You still have to fight with the network, right? And then you're fighting with the laws of physics. So if arrow is taking a big chunk out of this other section of things, is it only a small percentage? It can be a big part.
Transcript
Play full episode