Data Engineering Podcast cover image

Supporting And Expanding The Arrow Ecosystem For Fast And Efficient Data Processing At Voltron Data

Data Engineering Podcast

00:00

Arrow Data Structures

Arrow is very optimized for tabular data, which is a substantial portion of what people are trying to perform analysis on. But with the growth of machine learning and more scalable and capable compute frameworks, there has been an increase in usage of other formats of data such as binaries or images or videos. And so one thing that we've seen is embedding unstructured data in Arrow data structures. It's not going to be a fit for 100% of use cases. Like not everything is a table.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app