Super Data Science: ML & AI Podcast with Jon Krohn

Jon Krohn
undefined
May 6, 2025 • 1h 15min

885: Python Polars: The Definitive Guide, with Jeroen Janssens and Thijs Nieuwdorp

Jeroen Janssens and Thijs Nieuwdorp are data frame library Polars’ greatest advocates in this episode with Jon Krohn, where they discuss their book, Python Polars: The Definitive Guide, best practice for using Polars, why Pandas users are switching to Polars for data frame operations in Python, and how the library reduces memory usage and compute time up to 10x more than Pandas. Listen to the episode to be a part of an O’Reilly giveaway! Additional materials: ⁠www.superdatascience.com/885 This episode is brought to you by Trainium2, the latest AI chip from AWS, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:44) Why Jeroen and Thijs wrote Python Polars: The Definitive Guide   (21:54) Best practices in Polars  (25:55) Why Polars has so many users (34:32) The benefits of the Great Tables package (51:06) Jeroen and Thijs’ partnership with NVIDIA and Dell for Python Polars: The Definitive Guide
undefined
May 2, 2025 • 7min

884: Model Context Protocol (MCP) and Why Everyone’s Talking About It

Model Context Protocol (MCP) is Anthropic’s hottest tool, with over 1,000 community-built MCP servers in operation by February alone. In this Five-Minute Friday, Jon Krohn explains what took so long for users to catch on: Anthropic released MCP in November 2024. Hear more about the buzz behind MCP, its applications, and how easy it is to get started. Additional materials: ⁠www.superdatascience.com/884⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Apr 29, 2025 • 1h 4min

883: Blackwell GPUs Are Now Available at Your Desk, with Sama Bali and Logan Lawler

Returning after the “Super Bowl of AI”, NVIDIA GTC, Sama Bali and Logan Lawler talk to Jon Krohn about their respective work at tech giants NVIDIA and Dell. Sama and Logan discuss the next-gen Blackwell GPUs to their collaboration with Dell in launching Pro-Max PCs specially designed to take on heavy computational workloads as well as the incredible performance of GB 10 and GB 300 workstations, and the widening accessibility of AI developer tools and models.  Additional materials: www.superdatascience.com/883 This episode is brought to you by ODSC, the Open Data Science Conference, by Adverity, the conversational analytics platform and by the Dell AI Factory with NVIDIA. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information. In this episode you will learn: (07:29) About Dell’s Pro Max PCs (14:01) Why having a Blackwell GPU from Nvidia is a great option for those new to training and deploying AI models   (36:47) When it makes sense for a data scientist to switch from a Unix to a Windows based system  (46:33) Logan’s and Sama’s predictions for AI
undefined
Apr 25, 2025 • 10min

882: 40x Hotter Than the Sun: The ASML Machines That Make AI Chips

This week’s five-minute Friday heads to the Netherlands to find out more about Dutch company ASML, the brains behind the lithography machines that build AI chips.  Jon Krohn walks through how ASML came to dominate the market, where they’re headed next, and how ASML’s complex machines shape AI chips as well as the very future of AI.  Additional materials: www.superdatascience.com/882Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Apr 22, 2025 • 1h 17min

881: Beyond GPUs: The Power of Custom AI Accelerators, with Emily Webber

Emily Webber speaks to Jon Krohn about her work at Amazon Web Services, from its Annapurna Labs-developed Nitro System, a foundational technology that can enhance securities and performance in the cloud and how Trainium2 became AWS’ most powerful AI chip with four times the compute of Trainium. Hear the specs of AWS’s chips and when to use them.Additional materials: www.superdatascience.com/881This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (08:36) Emily’s work on AWS’ SageMaker and Trainium  (23:54) How AWS Neuron lets builders tailor their approach to using frameworks  (29:07) Why using an accelerator is better than using a GPU  (35:29) The key differences between AWS Trainium and AWS Trainium2  (52:45) How to select between AWS Trainium and AWS Trainium2
undefined
Apr 18, 2025 • 10min

880: Manus, DeepSeek and China’s AI Boom

First developed in China, Manus AI and DeepSeek have made great waves on an international scale. Sought-after for their cost-effectiveness compared to US-made tech, Manus AI and DeepSeek are quickly becoming dominant technologies inside the country. In this five-minute Friday, Jon Krohn asks: Do these technologies warrant the huge amount of resources spent on them by multiple industries in China, and what makes hype become a mainstay?Additional materials: www.superdatascience.com/880Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Apr 15, 2025 • 1h 7min

879: Serverless, Parallel, and AI-Assisted: The Future of Data Science is Here, with Zerve’s Dr. Greg Michaelson

Greg Michaelson speaks to Jon Krohn about the latest developments at Zerve, an operating system for developing and delivering data and AI products, including a revolutionary feature allowing users to run multiple parts of a program’s code at once and without extra costs. You’ll also hear why LLMs might spell trouble for SaaS companies, Greg’s ‘good-cop, bad-cop’ routine that improves LLM responses, and how RAG (retrieval-augmented generation) can be deployed to create even more powerful AI applications.Additional materials: www.superdatascience.com/879This episode is brought to you by Trainium2, the latest AI chip from AWS and by the Dell AI Factory with NVIDIA.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (04:00) Zerve’s latest features (35:26) How Zerve’s built-in API builder and GPU manager lowers barriers to entry (40:54) How to get started with Zerve (41:49) Will LLMs make SaaS companies redundant? (52:29) How to create fairer and more transparent AI systems (56:07) The future of software developer workflows
undefined
Apr 11, 2025 • 31min

878: In Case You Missed It in March 2025

AI stacks, AGI, training neural networks, and AI authenticity: Jon Krohn rounds up his interviews from March with this episode of “In Case You Missed It”. In his favorite clips from the month, he speaks to Andriy Burkov (Episode 867), Natalie Monbiot (Episode 873), Richmond Alake (Episode 871) and Varun Godbole (Episode 869). Additional materials: www.superdatascience.com/878Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Apr 8, 2025 • 1h 10min

877: The Neural Processing Units Bringing AI to PCs, with Shirish Gupta

NPUs, AIPC, and Dell’s growing suite of AI products: Shirish Gupta speaks to Jon Krohn about neural processing units and what makes them a go-to tool for AI inference workloads, reasons to move your workloads from the cloud and to your local devices, what the mnemonic AIPC stands for and why it will soon be on everyone’s lips, and he offers a special intro to Dell’s new Pro-AI Studio Toolkit. Hear about several real-world AIPC applications run by Dell’s clients, from detecting manufacturing defects to improving efficiencies for first responders, massively supporting actual life-or-death situations. Additional materials: www.superdatascience.com/877This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (03:28) What neural processing units (NPUs) are (23:53) About Dell Pro AI Studio  (35:03) Use cases for Dell Pro AI Studio (45:16) How AI development workflows and applications will change  (49:01) About Dell’s AI factory ecosystem
undefined
10 snips
Apr 4, 2025 • 15min

876: Hugging Face’s smolagents: Agentic AI in Python Made Easy

Discover how Hugging Face's smolagents are revolutionizing the landscape of agentic AI with their simple yet powerful Python library. Learn about their user-friendly features and multi-step reasoning capabilities that can transform how we develop autonomous AI agents. The discussion also highlights various frameworks, including a comparison with LangChain and Microsoft Autogen, encouraging engagement and feedback from listeners. This is a deep dive into making AI more accessible and efficient for personal and professional use!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app