The Real Python Podcast cover image

Speeding Up Your DataFrames With Polars

The Real Python Podcast

00:00

Is It Possible to Use Arrows in Multi-Threaded Scenarios?

The key idea there that it's really not a file format. It's more of a format for how data is going to be held in memory. Yeah. So it's setting up sort of sensible things like how things as I said how kind of numerical data is stored so that it really reads through the cache very efficiently. And also has a kind of more sane kind of unified plan for handling missing data. Okay. Is that different from Panda's? Oh, it would handle it normally? Good question.Yeah. Because one of the challenges with Panda is that missing data can be a bit different depending on the type of the column. Whereas in Arrow missing data is no just for everything.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app