Super Data Science: ML & AI Podcast with Jon Krohn cover image

Super Data Science: ML & AI Podcast with Jon Krohn

Latest episodes

undefined
Mar 21, 2025 • 12min

872: Microsoft’s “Majorana 1” Chip Brings Quantum ML Closer

In this five-minute Friday, Jon Krohn looks into Microsoft’s recent release of Majorana 1, a new quantum processing unit that uses topological qubits, a step away from the fragile qubits currently in use. Get Jon’s thoughts about this “transistor for the quantum age”, potential applications for quantum computing, and why this marks an exciting future for data science and machine learning practitioners.Additional materials: www.superdatascience.com/872Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Mar 18, 2025 • 1h 13min

871: NoSQL Is Ideal for AI Applications, with MongoDB’s Richmond Alake

Agentic AI, AI success strategies, and why flexibility will be so important to keep up with the AI market: Jon Krohn talks to Richmond Alake about the NoSQL database MongoDB, including why it’s a great addition to your toolkit for developing (agentic) AI applications, with a look under the hood at its native vector database. Richmond also talks about why he expects multi-agent AI architectures to go mainstream in 2025. Additional materials: www.superdatascience.com/871This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (04:10) How Richmond became a Staff Developer Advocate (07:40) How NoSQL database differs from a relational database (16:50) The advantages of working with the cloud-based MongoDB Atlas (32:26) Richmond’s predictions for agentic AI (40:38) How to create an effective AI strategy
undefined
Mar 14, 2025 • 17min

870: OpenAI’s “Deep Research”: Get Days of Human Work Done in Minutes

In this Five-Minute Friday, Jon Krohn looks into what he considers the world’s most powerful research tool to date, OpenAI’s Deep Research. Find out how OpenAI trained Deep Research to compile literature reviews of limitless topics, what similar tools are on the market, and where Jon sees the tool as having real-world value including how he uses it daily.Additional materials: www.superdatascience.com/870Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Mar 11, 2025 • 1h 20min

869: AI Should Make Humans Wiser (But It Isn’t), with Varun Godbole

Jon Krohn talks to Varun Godbole about AI prompt engineering, generative wisdom, and AI generalists in this episode all about the interrelationships between humans and AI.Additional materials: www.superdatascience.com/869This episode is brought to you by the Dell AI Factory with NVIDIA and by ODSC, the Open Data Science Conference.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (10:44) Using deep learning to predict breast cancer (15:55) All about Varun’s Tuning Playbook (29:56) On the explosion of interest and news about AI and data science  (46:35) About Varun’s Wise AI
undefined
Mar 7, 2025 • 27min

868: In Case You Missed It in February 2025

How to start a successful tech company, and how you can get started with DBT, TabPFN and BAML: Jon Krohn rounds up his favorite moments from February in this episode of “In Case You Missed It”.Additional materials: www.superdatascience.com/868Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Mar 4, 2025 • 1h 33min

867: LLMs and Agents Are Overhyped, with Dr. Andriy Burkov

The realities of Agentic AI, AGI, and chatbots that don’t hallucinate: Andriy Burkov talks to Jon Krohn about AI in 2025. Best known for his concise machine learning modelling books, author and AI influencer Andriy Burkov also talks about his latest publication in the series, The Hundred-Page Language Learning Models Book. Additional materials: www.superdatascience.com/867This episode is brought to you by the Dell AI Factory with NVIDIA.Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (07:38) Andriy’s “triology” of books on machine learning (29:32) On the limitations of AI agents (41:12) On the prospect of artificial general intelligence (AGI) (54:24) On developing a chatbot that doesn’t hallucinate (01:10:07) On open-weight and open-source LLMs
undefined
Feb 28, 2025 • 8min

866: Bringing Back Extinct Animals like the Woolly Mammoth and Dodo Bird

Jon Krohn addresses a question for the ages: How close are we, really, to Jurassic Park? Dallas-based biotech company Colossal Biosciences is developing technology that aims to return previously extinct animals like the dodo and woolly mammoth to earth and, crucially, pull many others like the white rhino back from the brink of extinction. Additional materials: www.superdatascience.com/866Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Feb 25, 2025 • 54min

865: How to Grow (and Sell) a Data Science Consultancy, with Cal Al-Dhubaib

Jon Krohn talks to Cal Al-Dhubaib about the extraordinary success of AI and machine learning solutions provider Pandata, his ironclad hack for any company to define their core values, and how to attract and secure loyal clients. Cal thinks tech professionals make two critical mistakes in their careers: The first is that they too-often enjoy being the gatekeepers of their work rather than educating their clients and coworkers as to the details of their projects and why it benefits the company. The second is that tech professionals don’t show vulnerability, whether that means not knowing a topic or not fully understanding how a business works. This issue, Cal says, can spell the difference between a startup’s success and failure. Learn how tech startups can make an ironclad strategy for their future in this episode of The SuperDataScience Podcast.This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (09:32) How to scale a successful data science consultancy (22:25) How Pandata navigates highly regulated environments  (27:59) How to tackle tech illiteracy in business  (36:32) What skills Cals looks for in new hires  (35:56) How to sell on a tech company  Additional materials: www.superdatascience.com/865
undefined
Feb 21, 2025 • 8min

864: OpenAI’s o3-mini: SOTA reasoning and exponentially cheaper

Jon Krohn investigates OpenAI’s new release, o3-mini, in this five-minute Friday, where he walks through the reasoning model’s capabilities and performance, cross-examining them against other major-league players, DeepSeek-R1, GPT-4o and Claude 3.5 Sonnet.Additional materials: www.superdatascience.com/864Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.
undefined
Feb 18, 2025 • 1h 6min

863: TabPFN: Deep Learning for Tabular Data (That Actually Works!), with Prof. Frank Hutter

Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data.This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.In this episode you will learn: (05:57) All about the TabPFN architecture  (21:27) Use cases for Bayesian inference (35:07) On getting published in Nature (44:03) How TabPFN handles time series data (51:52) All about Prior Labs Additional materials: www.superdatascience.com/863

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner