This hands-on guide by Jeroen Janssens and Thijs Nieuwdorp provides complete coverage of Polars' capabilities for real-world data tasks. It teaches data loading from multiple sources (CSV, Parquet, databases), expression building, handling complex data types, and visualization with tools like Altair and plotnine. The book includes practical examples with real datasets available on GitHub, and covers advanced topics like GPU acceleration and Rust plugin integration.
This thoroughly revised guide demonstrates how the flexibility of the command line can help data scientists become more efficient and productive. It teaches how to combine Unix power tools to quickly obtain, scrub, explore, and model data, even for those comfortable with Python or R. The book includes a Docker image with over 100 Unix tools, emphasizing agility, scalability, and extensibility in data workflows.
Brian Kernighan's "Unix: A History and a Memoir" offers a personal and insightful account of the development of the Unix operating system. The book details the unique environment at Bell Labs that fostered innovation and collaboration, highlighting the contributions of key figures like Ken Thompson and Dennis Ritchie. Kernighan shares anecdotes and reflections on the design principles and cultural factors that shaped Unix's success. The book explores the evolution of Unix, its impact on the computing world, and its enduring legacy. It provides a valuable historical perspective on a pivotal technology and its lasting influence.
Deep Learning Illustrated is a comprehensive guide to deep learning, offering a unique visual and interactive approach. It explains deep learning techniques through straightforward analogies, vivid illustrations, and hands-on Python code in Jupyter notebooks. The book covers essential theory, including artificial neurons, training, optimization, convolutional nets, recurrent nets, generative adversarial networks (GANs), and deep reinforcement learning. It also focuses on practical applications such as machine vision, natural language processing, image generation, and game-playing algorithms, using libraries like Keras, TensorFlow, and PyTorch.
In 'Immune', Philipp Dettmer takes readers on a journey through the human body's defense mechanisms, detailing the intricate and constant battle against pathogens. The book delves into elements of the immune system such as antibodies, inflammation, and threats like bacteria, allergies, and cancer. Dettmer explains why boosting the immune system is often a misconception, how parasites and viruses operate, and what happens in the body during injuries. The book is enriched with full-color graphics and immersive descriptions, making immunology a gripping adventure. It also discusses the Covid-19 pandemic and the diversity of human immune responses.
Jeroen Janssens and Thijs Nieuwdorp are data frame library Polars’ greatest advocates in this episode with Jon Krohn, where they discuss their book, Python Polars: The Definitive Guide, best practice for using Polars, why Pandas users are switching to Polars for data frame operations in Python, and how the library reduces memory usage and compute time up to 10x more than Pandas. Listen to the episode to be a part of an O’Reilly giveaway!
Additional materials: www.superdatascience.com/885
This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA.
Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
In this episode you will learn:
-
(07:44) Why Jeroen and Thijs wrote Python Polars: The Definitive Guide
-
(21:54) Best practices in Polars
-
(25:55) Why Polars has so many users
-
(34:32) The benefits of the Great Tables package
-
(51:06) Jeroen and Thijs’ partnership with NVIDIA and Dell for Python Polars: The Definitive Guide