The Real Python Podcast cover image

Wes McKinney on Improving the Data Stack & Composable Systems

The Real Python Podcast

00:00

Exploring DuckDB, Apache Arrow Integration, and Pandas 2.0 Development

This chapter delves into the discussion about DuckDB, Apache Arrow integration, and the development of Pandas 2.0. It highlights the embeddable nature of DuckDB, collaboration with Apache Arrow for a native interchange format, and the introduction of the pollers data frame library. The conversation also covers the challenges in NumPy and pandas, focusing on memory usage improvements, core data type changes, and motivations behind the extension array subsystem.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app