
Super Data Science: ML & AI Podcast with Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Latest episodes

Aug 4, 2023 • 11min
702: Llama 2 — It's Time to Upgrade your Open-Source LLM
This week, Jon Krohn is examining Meta's newly released open-source large language model, Llama 2, highlighting its commercial prospects, immense capacity, model variety, and unique 'time awareness' feature. He also discusses its innovative two-stage RLHF approach that enhances its performance.Additional materials: www.superdatascience.com/702Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Aug 1, 2023 • 1h 21min
701: Generative A.I. without the Privacy Risks (with Prof. Raluca Ada Popa)
Dr. Raluca Ada Popa, renowned computer scientist, entrepreneur, and President of Opaque Systems, joins Jon Krohn to share her insights on securely interacting with AI APIs like OpenAI's GPT-4, the pros and cons of open vs. closed-source AI development, and the seamless operation of compute pipelines across multiple clouds.This episode is brought to you by AWS Inferentia and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• What is a confidential computing platform? [04:31]• How to get started with confidential computing [12:10]• The challenges of confidential computing and LLMs [21:11]• How to safeguard your data while using commercial LLMs like GPT-4 [38:00]• Open-source vs closed-source [52:28]• Raluca's PreVail cybersecurity company [1:01:50]• Combining entrepreneurship and academic career [1:04:03]• DARE Program [1:10:39]Additional materials: www.superdatascience.com/701

Jul 28, 2023 • 5min
700: "The Dream of Life" by Alan Watts
Yoga and Hindu mythology: This special episode continues the thread of our centenary episodes, SDS 500: Yoga Nidra with Jes Allen and SDS 600: Yoga Nidra Practice with Steve Fazzari, which talked through guided meditation techniques to help improve posture, sleep, and expand consciousness. Inspired by these sessions, host Jon Krohn explores Hindu mythology via Alan Watts’ “The Dream of Life”.Additional materials: www.superdatascience.com/700Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 25, 2023 • 51min
699: The Modern Data Stack, with Harry Glaser
Model deployment, data warehouse options for running models, and how to best leverage BI tools: Harry Glaser and Jon Krohn discuss Modelbit’s capabilities to automate ML models from notebooks into production-ready models, reducing the time and effort in ‘translating’ information from one mode to another. Harry’s conversation with host Jon Krohn expanded on the importance of automating this task, and how developments in ML modeling have widened access to entire teams to analyze data, whatever their level of expertise.This episode is brought to you by the AWS Insiders Podcast. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• What the modern data stack is [03:28]• Version control for data scientists [13:30]• CI/CD, load balancing and logging [20:38]• Snowflake vs. Redshift [30:10]• How tools like Looker and Tableau help monitor models [35:26]Additional materials: www.superdatascience.com/699

Jul 21, 2023 • 28min
698: How Firms Can Actually Adopt A.I., with Rehgan Avon
Company-wide AI adoption can take a lot of persuasion. Rehgan Avon talks to host Jon Krohn about why AI has become necessary for forward-thinking businesses and the steps to implement AI in an institution so that everyone benefits.Additional materials: www.superdatascience.com/698Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 18, 2023 • 1h 27min
697: The (Short) Path to Artificial General Intelligence, with Dr. Ben Goertzel
AI visionary and CEO of SingularityNET Dr. Ben Goertzel provides a deep dive into the possible realization of Artificial General Intelligence (AGI) within 3-7 years. Explore the intriguing connections between self-awareness, consciousness, and the future of Artificial Super Intelligence (ASI) and discover the transformative societal changes that could arise.This episode is brought to you by AWS Inferentia, by the AWS Insiders Podcast, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Decentralized and benevolent AGI [03:13] • The SingularityNET ecosystem [13:10]• Dr. Goertzel's vision for realizing AGI - combining DL with neuro-symbolic systems, genetic algorithms and knowledge graphs [25:50]• How reaching AGI will trigger Artificial Super Intelligence [38:51]• Dr. Goertzel's approach to AGI using OpenCog Hyperon [42:34]• Why Dr. Goertzel believes AGI will be positive for humankind [53:07]• How to ensure the AGI is benevolent [1:06:43]• How AGI or ASI may act ethically [1:13:50]Additional materials: www.superdatascience.com/697

Jul 14, 2023 • 1h 3min
696: Brain-Computer Interfaces and Neural Decoding, with Prof. Bob Knight
Jon Krohn welcomes Professor Dr. Bob Knight to explore human intelligence, the prefrontal cortex, and the transformative potential of brain implants for data collection. Discover the pivotal role of machine learning in treating Parkinson's and delve into exciting future advancements.Additional materials: www.superdatascience.com/696Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 11, 2023 • 1h 38min
695: NLP with Transformers, feat. Hugging Face's Lewis Tunstall
What are transformers in AI, and how do they help developers to run LLMs efficiently and accurately? This is a key question in this week’s episode, where Hugging Face’s ML Engineer Lewis Tunstall sits down with host Jon Krohn to discuss encoders and decoders, and the importance of continuing to foster democratic environments like GitHub for creating open-source models.This episode is brought to you by the AWS Insiders Podcast, by WithFeeling.ai, the company bringing humanity into AI, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• What a transformer is, and why it is so important for NLP [04:34]• Different types of transformers and how they vary [11:39]• Why it’s necessary to know how a transformer works [31:52]• Hugging Face’s role in the application of transformers [57:10]• Lewis Tunstall’s experience of working at Hugging Face [1:02:08]• How and where to start with Hugging Face libraries [1:18:27]• The necessity to democratize ML models in the future [1:25:25]Additional materials: www.superdatascience.com/695

Jul 7, 2023 • 8min
694: CatBoost: Powerful, efficient ML for large tabular datasets
Modeling tabular data and spreadsheets doesn’t have to be tedious with CatBoost’s open-source tree-boosting algorithm. CatBoost does what it says on the tin, blending categories with boosting that allows you to train your models faster and handle large datasets for ML tasks across multiple GPUs. In this week’s Five-Minute Friday, host Jon Krohn gets to grips with the technical components of CatBoost that give it the speed and accuracy so acclaimed by its users.Additional materials: www.superdatascience.com/694Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Jul 4, 2023 • 1h 20min
693: YOLO-NAS: The State of the Art in Machine Vision, with Harpreet Sahota
Harpreet Sahota, a data science expert and deep learning developer at Deci AI, joins Jon Krohn to explore the fascinating realm of object detection and the revolutionary YOLO-NAS model architecture. Discover how machine vision models have evolved and the techniques driving compute-efficient edge device applications..This episode is brought to you by AWS Inferentia, by WithFeeling.ai, the company bringing humanity into AI, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• What is machine vision? [07:02]• Object detection and YOLO architectures [13:00]• Deci's YOLO-NAS: Optimal object detection model architecture [23:39]• Developer Relations [1:00:16]• Harpreet's 'top-down' approach to learning Deep Learning [1:06:50]Additional materials: www.superdatascience.com/693