
523: Open-Source Analytical Computing (pandas, Apache Arrow)
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Challenges of Notebook Organization and Potential Future Support for Nested Columns in pandas
This chapter explores the difficulties in managing physical notebooks and the potential for pandas to support nested columns like Parquet and BigQuery. The conversation delves into the efficient handling of nested data by Apache Arrow, the importance of extension types in pandas, and the option of funding these advancements through Numbfocus donations.
Transcript
Play full episode