
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Jun 14, 2022 • 1h 15min
583: The State of Natural Language Processing
In this episode, natural language processing (NLP) expert and Lead Data Scientist at CB Insights, Rongyao Huang, joins Jon Krohn to discuss NLP. Listen in for a thorough review of the field over the past decade and how the coming iron age of NLP will help us overcome the limitations of today's approaches.In this episode you will learn:
The evolution of NLP techniques over the past decade [4:14]
What's next in the coming iron age of NLP [35:33]
Rongyao’s Bauhaus-inspired model for effective data science [43:12]
Rongyao's long-term career pathfinding framework [51:50]
Rongyao’s top tips for staying sane while juggling career and family [1:00:30]
Additional materials: www.superdatascience.com/583

Jun 10, 2022 • 3min
582: Model Speed vs Model Accuracy
In this episode, Jon wraps up his three-part series on business value and machine learning. Listen in as he explains why starting with simple models is best, and why speed is likely more important to your users than accuracy.Additional materials: www.superdatascience.com/582

Jun 7, 2022 • 1h 25min
581: Bayesian, Frequentist, and Fiducial Statistics in Data Science
In this episode founding Editor-in-Chief of the Harvard Data Science Review and Professor of Statistics at Harvard University, Prof. Xiao-Li Meng, joins Jon Krohn to dive into data trade-offs that abound, and shares his view on the paradoxical downside of having lots of data.In this episode you will learn:
What the Harvard Data Science Review is and why Xiao-Li founded it [5:31]
The difference between data science and statistics [17:56]
The concept of 'data minding' [22:27]
The concept of 'data confession' [30:31]
Why there’s no “free lunch” with data, and the tricky trade-offs that abound [35:20]
The surprising paradoxical downside of having lots of data [43:23]
What the Bayesian, Frequentist, and Fiducial schools of statistics are, and when each of them is most useful in data science [55:47]
Additional materials: www.superdatascience.com/581

Jun 3, 2022 • 6min
580: Collecting Valuable Data
In this episode, Jon resumes his series on strategies for getting business value from machine learning. Part one saw him review several ways to identify a commercial problem before starting data collection or ML model development. And now, in part two, Jon digs into the data collection process.Additional materials: www.superdatascience.com/580

May 31, 2022 • 47min
579: Transforming Dentistry with A.I.
In this episode, the CEO of Overjet, Dr. Wardah Inam, joins Jon Krohn to discuss the classification and quantification of dental diagnoses with computer vision, her data labeling challenges, and tips for building a successful A.I. business.In this episode you will learn:
How Overjet leverages computer vision to qualify and quantify dental diagnoses [5:11]
How A.I. solutions reduce the under-diagnosis of common diseases like periodontal disease [8:15]
Overjet's particular ML challenges within the dental industry [15:45]
Wardah's experience in introducing A.I. to the dental industry [20:12]
Wardah's tips for building a successful A.I. business [23:34]
What she looks for in the data scientists and software engineers she hires [39:36]
Additional materials: www.superdatascience.com/579

May 27, 2022 • 4min
578: Identifying Commercial ML Problems
In this episode, Jon kicks off a new Five-Minute Friday series that explores the strategies for getting business value from machine learning. Part one sees him review several ways to identify a commercial problem before starting data collection or ML model development.Additional materials: www.superdatascience.com/578

May 24, 2022 • 55min
577: Scaling A.I. Startups Globally
In this episode, the former CEO and co-founder behind Onfido, an AI-based ID verification, joins Jon Krohn to discuss his path to start-up success. Tune in to hear valuable information from Husayn Kassai.In this episode you will learn:
How Husayn's start-up journey began [5:55]
How Husayn determined that his challenge could be solved by machine vision [11:18]
Onfido's initial seed stages [18:23]
Launching and scaling your start-up in the U.S. market [22:00]
The most important component in building the best product [26:30]
Husayn's latest start-up [28:52]
Husayn’s startup project decision-making process [37:49]
Choosing your co-founding team [44:04]
Additional materials: www.superdatascience.com/577

May 20, 2022 • 3min
576: Tech Startup Dramas
Hollywood has officially fallen for the drama of tech startups! Tune in to hear Jon Krohn review the small-screen adaptations of WeWork (WeCrashed), Uber (Super Pumped), and Theranos (The Dropout).Additional materials: www.superdatascience.com/576

May 17, 2022 • 1h 24min
575: Optimizing Computer Hardware with Deep Learning
In this episode, the Director of Architecture at NVIDIA, Dr. Magnus Ekman, joins Jon Krohn to discuss how machine learning, including deep learning, can optimize computer hardware design. The pair also review his exceptional book 'Learning Deep Learning.'In this episode you will learn:
What hardware architects do [10:15]
How ML can optimize hardware speed [ 13:19]
Magnus’s Deep Learning Book [21:14]
Is understanding how ML models work important? [36:16]
Algorithms inspired by biological evolution [41:25]
How artificial general intelligence won’t be obtained by increasing model parameters alone [51:24]
Why there will always be a place for CNNs and RNNs [54:51]
How people can "transition" realistically into ML [1:09:15]
Additional materials: www.superdatascience.com/575

May 13, 2022 • 4min
574: Music for Deep Work
In this episode, Jon shares how the right music can power your productivity. It's no secret that he's a big fan of 'deep work,' but this week, he opens up about the artists, sites, and playlists that propel his productivity to new levels.Additional materials: www.superdatascience.com/574