
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Mar 21, 2025 • 12min
872: Microsoft’s “Majorana 1” Chip Brings Quantum ML Closer
In this five-minute Friday, Jon Krohn looks into Microsoft’s recent release of Majorana 1, a new quantum processing unit that uses topological qubits, a step away from the fragile qubits currently in use. Get Jon’s thoughts about this “transistor for the quantum age”, potential applications for quantum computing, and why this marks an exciting future for data science and machine learning practitioners.Additional materials: www.superdatascience.com/872Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Mar 18, 2025 • 1h 13min
871: NoSQL Is Ideal for AI Applications, with MongoDB’s Richmond Alake
Agentic AI, AI success strategies, and why flexibility will be so important to keep up with the AI market: Jon Krohn talks to Richmond Alake about the NoSQL database MongoDB, including why it’s a great addition to your toolkit for developing (agentic) AI applications, with a look under the hood at its native vector database. Richmond also talks about why he expects multi-agent AI architectures to go mainstream in 2025. Additional materials: www.superdatascience.com/871This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(04:10) How Richmond became a Staff Developer Advocate
(07:40) How NoSQL database differs from a relational database
(16:50) The advantages of working with the cloud-based MongoDB Atlas
(32:26) Richmond’s predictions for agentic AI
(40:38) How to create an effective AI strategy

Mar 14, 2025 • 17min
870: OpenAI’s “Deep Research”: Get Days of Human Work Done in Minutes
In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily.Additional materials: www.superdatascience.com/870Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Mar 11, 2025 • 1h 20min
869: AI Should Make Humans Wiser (But It Isn’t), with Varun Godbole
Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI.Additional materials: www.superdatascience.com/869This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(10:44) Using deep learning to predict breast cancer
(15:55) All about Varun’s Tuning Playbook
(29:56) On the explosion of interest and news about AI and data science
(46:35) About Varun’s Wise AI

Mar 7, 2025 • 27min
868: In Case You Missed It in February 2025
How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”.Additional materials: www.superdatascience.com/868Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Mar 4, 2025 • 1h 33min
867: LLMs and Agents Are Overhyped, with Dr. Andriy Burkov
The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book. Additional materials: www.superdatascience.com/867This episode is brought to you by the Dell AI Factory with NVIDIA.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(07:38) Andriy’s “triology” of books on machine learning
(29:32) On the limitations of AI agents
(41:12) On the prospect of artificial general intelligence (AGI)
(54:24) On developing a chatbot that doesn’t hallucinate
(01:10:07) On open-weight and open-source LLMs

Feb 28, 2025 • 8min
866: Bringing Back Extinct Animals like the Woolly Mammoth and Dodo Bird
Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and woolly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction. Additional materials: www.superdatascience.com/866Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Feb 25, 2025 • 54min
865: How to Grow (and Sell) a Data Science Consultancy, with Cal Al-Dhubaib
Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast.This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(09:32) How to scale a successful data science consultancy
(22:25) How Pandata navigates highly regulated environments
(27:59) How to tackle tech illiteracy in business
(36:32) What skills Cals looks for in new hires
(35:56) How to sell on a tech company
Additional materials: www.superdatascience.com/865

Feb 21, 2025 • 8min
864: OpenAI’s o3-mini: SOTA reasoning and exponentially cheaper
Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet.Additional materials: www.superdatascience.com/864Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.

Feb 18, 2025 • 1h 6min
863: TabPFN: Deep Learning for Tabular Data (That Actually Works!), with Prof. Frank Hutter
Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data.This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn:
(05:57) All about the TabPFN architecture
(21:27) Use cases for Bayesian inference
(35:07) On getting published in Nature
(44:03) How TabPFN handles time series data
(51:52) All about Prior Labs
Additional materials: www.superdatascience.com/863
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.