
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Nov 1, 2024 • 9min
832: The Anthropic CEO’s Techno-Utopia
Host Jon Krohn unpacks Dario Amodei’s vision of a techno-utopia in his essay Machines of Loving Grace, where “Powerful AI” takes center stage. Amodei, CEO of Anthropic, imagines a future where AI doesn’t just assist but actively shapes fields like healthcare, economics, and governance with unmatched intelligence and autonomy. Jon explores the possibilities and challenges of this AI-driven future, asking how close we are to seeing these revolutionary shifts and what they mean for society.Additional materials: www.superdatascience.com/832Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Oct 29, 2024 • 1h 23min
831: PyTorch Lightning, Lit-Serve and Lightning Studios, with Dr. Luca Antiga
PyTorch Lightning is revolutionizing the AI landscape, and Dr. Luca Antiga, CTO of Lightning AI, joins host Jon Krohn to explain how. In this episode, they explore the tools pushing AI development forward, from Lightning Studios to Lit-Serve, and discuss the game-changing rise of small language models that challenge industry giants with precision and speed. Luca also shares his vision for developers in an AI-enhanced world, where coding meets creativity and collaboration with intelligent tools.This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
How Lightning AI's open-source tools make AI development faster [11:30]
The rise of small language models and how they'll rival LLMs [37:47]
Luca's journey from biomedical imaging to deep learning pioneer [52:03]
How AI will transform software developer tasks [1:03:05]
Additional materials: www.superdatascience.com/831

Oct 25, 2024 • 11min
830: The “A.I.” Nobel Prizes (in Physics and Chemistry??)
Geoffrey Hinton and Sir Demis Hassabis: The Nobel Prize committee is an achievement of the highest order, awarding physicists, chemists, physiologists, medical practitioners, writers, pacifists and economists perhaps the greatest honor in their respective fields. In this week’s Five-Minute Friday, Jon Krohn discusses how two AI pioneers came to win prizes in chemistry and physics.Additional materials: www.superdatascience.com/830Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Oct 22, 2024 • 1h 37min
829: Neuroscience Fueled by ML, with Prof. Bradley Voytek
Neuroscientist Bradley Voytek outlines to Jon Krohn the incredible use of data science and machine learning in his research and how recent discoveries in action potentials and neurons have completely skyrocketed the field to a new understanding of the brain and its functions. You’ll also hear what Bradley thinks is most important when hiring data scientists and his contributions to Uber’s algorithm when it was still a startup. This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
Breakthroughs in brain region communication [04:08]
The future of brain research and MedTech [35:24]
The libraries and software used at the Halicioglu Data Science Institute [45:11]
Brain rhythm as a diagnostic tool [1:02:58]
Bradley’s curriculum structure at UC San Diego [1:12:21]
How Uber applies data science [1:20:07]
Additional materials: www.superdatascience.com/829

Oct 18, 2024 • 20min
828: Are “Citizen Data Scientists” A Myth? With Keith McCormick
The citizen data scientist: Fact or fiction? Jon Krohn holds a conversation across episodes in this Five-Minute Friday, with today’s guest Keith McCormick, in part responding to Nick Elprin’s interview in episode 811: Scaling Data Teams Effectively.Additional materials: www.superdatascience.com/828Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Oct 15, 2024 • 1h 14min
827: Polars: Past, Present and Future, with Polars Creator Ritchie Vink
Ritchie Vink, CEO and Co-Founder of Polars, Inc., speaks to Jon Krohn about the new achievements of Polars, an open-source library for data manipulation. This is the episode for any data scientist on the fence about using Polars, as it explains how Polars managed to make such improvements, the APIs and integration libraries that make it so versatile, and what’s next for this efficient library.This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
Why Polars is so efficient [05:20]
Polars’ easy integration with other data-processing tools [21:23]
Eager vs lazy executive in Polars [32:15]
Polars’ data processing of large- and small-scale datasets [38:28]
Ritchie’s plans to scale his company [46:14]
Upcoming features in Polars [58:06]
Additional materials: www.superdatascience.com/827

Oct 11, 2024 • 42min
826: In Case You Missed It in September 2024
Next-gen IDEs, efficiency-boosting open-source Python libraries, and changes in hiring for data scientists: This episode of In Case You Missed It gives you our best clips of September’s interviews, hosted by Jon Krohn.Additional materials: www.superdatascience.com/826Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Oct 8, 2024 • 1h 2min
825: Data Contracts: The Key to Data Quality, with Chad Sanderson
Data contracts are redefining data quality and governance, and Chad Sanderson, CEO of Gable.ai, joins host Jon Krohn to explain how they can transform your data strategy. He breaks down what data contracts are, how they shift data quality checks closer to production, and why they’re essential for reducing data debt. Chad also highlights how better alignment between data producers and consumers can elevate data reliability and tackle change-management challenges in modern organizations.This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, and by Gurobi, the Decision Intelligence Leader. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
What data contracts are and how they define expectations for data quality [03:16]
What data contracts look like [09:09]
The common misconceptions about data quality when implementing AI [12:55]
Chad’s Chief Operator role at Data Quality Camp [19:46]
How “shifting left” improves data reliability by addressing issues early [24:17]
Why data professionals still struggle with data quality [30:31]
How data debt forms and why it leads to complex, inefficient architectures [35:53]
How will the role of human oversight evolve in ensuring data quality? [47:12]
How can data teams leverage storytelling? [52:33]
Additional materials: www.superdatascience.com/825

Oct 4, 2024 • 14min
824: Llama 3.2: Open-Source Edge and Multimodal LLMs
Llama 3.2 brings a new era of AI innovation with lightweight models tailored for on-device applications and powerful vision models for handling complex image inputs. Host Jon Krohn explores how this release pushes the boundaries of open-source AI, making it more accessible and versatile for developers. He also covers the Llama Stack toolkit, designed to streamline deployment, and Llama Guard 3, Meta’s latest content moderation solution. With extensive support from major cloud and hardware partners, Llama 3.2 is set to unlock groundbreaking possibilities for AI across mobile and beyond. Tune in to hear more.Additional materials: www.superdatascience.com/824Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Oct 1, 2024 • 1h 21min
823: Virtual Humans and AI Clones, with Natalie Monbiot
Virtual humans are rewriting the rules of digital communication and reshaping entire industries. This week, Jon Krohn welcomes Natalie Monbiot, Head of Strategy at Hour One, to shed light on how AI avatars are revolutionizing L&D and e-commerce by turning traditional training and product listings into captivating, presenter-led content.This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick, by Gurobi, the Decision Intelligence Leader, and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:• How do you create a virtual being? [10:55]• Reid Hoffman's avatar [13:40]• The virtual human economy [31:07]• Virtual human societies [51:24]• Virtual humans and creative expression [56:35]• Challenges in maintaining transparency [01:00:22]Additional materials: www.superdatascience.com/823