Super Data Science: ML & AI Podcast with Jon Krohn cover image

815: Polars: Faster DataFrame Ops, with Marco Gorelli

Super Data Science: ML & AI Podcast with Jon Krohn

NOTE

Extend and Optimize with Polars

Polars is a lightweight data processing tool that doesn't require heavy dependencies, such as PyArrow, pandas, or NumPy. Its ability to extend functionality through custom Rust plugins allows for tailored solutions that integrate seamlessly with existing workflows. This flexibility enables users to add specific plugins or create their own, which can be distributed via PyPI for easy installation. Additionally, the efficiency of lazy execution in Polars allows for managing data constraints effectively, enabling queries to be run on massive datasets without exceeding memory limits by only reading necessary rows. This combination of extensibility and optimization makes Polars a powerful choice for data processing tasks in environments like AWS Lambda.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner