

Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast. As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
Episodes
Mentioned books

Sep 26, 2023 • 1h 21min
717: Overcoming Adversaries with A.I. for Cybersecurity, with Dr. Dan Shiebler
Dr. Dan Shiebler, Head of ML at Abnormal Security, joins Jon Krohn this week and unveils the intricacies of cybercrime detection and email protection, and the role of AI in future challenges.This episode is brought to you by Grafbase, the unified data layer, by ODSC, the Open Data Science Conference, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• The heuristic and “intermediate” ML models that they develop at Abnormal Security [07:08]• How Dan uses LLMs at Abnormal Security [15:46]• How false negatives are individually the biggest classification error to avoid in cybersecurity [20:49]• How head-to-head competitor analysis helps refine models [34:34]• Resilient ML in cybersecurity [38:36]• Abnormal Security’s routine for updating their models [52:37]• AI's impact on the urban world [1:09:57]• How to stay updated in data science and AI [1:13:46]Additional materials: www.superdatascience.com/717

Sep 22, 2023 • 14min
716: Happiness and Life-Fulfillment Hacks
Jon Krohn's 94-year-old grandmother, Annie, who's bursting with life and wisdom, shares her recipe to lifelong happiness and how relationships and daily intentions play an integral role. Annie also shares her curious take on modern technology. Get inspired by her infectious joy and perspective on life.Additional materials: www.superdatascience.com/716Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Sep 19, 2023 • 1h 56min
715: Make Better Decisions with Data, with Dr. Allen Downey
Join us as Dr. Allen Downey, renowned author and professor, shares insights from his upcoming book 'Probably Overthinking It,' breaking down underused techniques like Survival Analysis, explaining common paradoxes, and discussing the dynamic Overton Window.This episode is brought to you by the Zerve data science dev environment, by Modelbit, for deploying models in seconds, and by Grafbase, the unified data layer. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Why interpreting data is not always easy [06:21]• What is Survival Analysis [15:32]• Preston's Paradox [22:09]• Are you Normal? [36:52]• How to better prepare for rare “Black Swan” events [42:48]• What is an Overton Window? [53:06]• What is the base rate fallacy? [1:23:31]• How to protect yourself from biased samples [1:33:39]• Simpson’s Paradox [1:42:43]Additional materials: www.superdatascience.com/715

Sep 15, 2023 • 37min
714: Using A.I. to Overcome Blindness and Thrive as a Data Scientist
In this Friday episode, guest Tim Albiges explores with host Jon Krohn how people with blindness can have a lucrative and fulfilling career in data science, how Tim’s PhD thesis applied machine learning to help diagnose chronic respiratory diseases, and the communication tools that blind people can use to live a full and independent life.Additional materials: www.superdatascience.com/714Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Sep 12, 2023 • 1h 26min
713: Llama 2, Toolformer and BLOOM: Open-Source LLMs with Meta's Dr. Thomas Scialom
Artificial General Intelligence, RLHF’s application in AI, and how entrepreneurs can enter the AI industry: Meta’s AI Research Scientist Thomas Scialom gives us behind-the-scenes insights into developing Llama 2 and what’s in the works for Llama 3. With host Jon Krohn, he discusses the future of Artificial General Intelligence, why the Galactica science-focused LLM was taken down, and what he learned from it.This episode is brought to you by AWS Inferentia, by Grafbase, the unified data layer, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Llama 2: Behind the Scenes of Today’s Top Open-Source LLM [05:04]• Responsible use of Llama 2 [15:26]• Toolformer: LLM That Learns How to Use External Tools [24:57]• Galactica: The Science-Specific LLM and Why It Was Brought Down [36:57]• Is AGI Around the Corner? [57:03]• Advice for AI entrepreneurs [1:05:46]• How Thomas develops and manages large-scale AI projects [1:14:42]Additional materials: www.superdatascience.com/713

Sep 8, 2023 • 7min
712: Code Llama
Code Llama might just be starting the revolution for how data scientists code. In this Five-Minute Friday, host Jon Krohn investigates the suite of models under the free-to-use Code Llama and how to find the best fit for your project’s needs.Additional materials: www.superdatascience.com/712Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Sep 5, 2023 • 1h 26min
711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain
In this episode, host Jon Krohn explores with his guest Ajay Jain, Co-Founder of Genmo.ai, how creative general intelligence could take the video industry by storm. They also discuss the models that got Genmo to this point, the applications of NeRF, and how understanding human psychology is so essential to developing models that output high-fidelity video.This episode is brought to you by the Zerve data science dev environment, by Grafbase, the unified data layer, and by Modelbit, for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• About Genmo.ai and the term “creative general intelligence” [03:47]• Why Ajay started Genmo.ai [09:26]• The increased performance of multimodal models [21:12]• All about Denoising Diffusion Probabilistic Models (DDPMs) [31:03]• The application of Neural Radiance Fields (NeRF) [55:26]• Predicting pedestrian behavior at Uber [1:01:50]• How to save money in the process of training models [1:12:42]Additional materials: www.superdatascience.com/711

Sep 1, 2023 • 1h 3min
710: LangChain: Create LLM Applications Easily in Python
Discover the power of Large Language Models with Kris Ograbek as he unravels the intricacies of LangChain and showcases a chatbot in action, all while putting our host Jon Krohn in the hot seat!Additional materials: www.superdatascience.com/710Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

Aug 29, 2023 • 1h 21min
709: Big A.I. R&D Risks Reap Big Societal Rewards, with Meta's Dr. Laurens van der Maaten
Meta's Senior Research Director, Dr. Laurens van der Maaten, takes center stage to unravel the captivating realm of AI innovation. Learn about his groundbreaking contributions, including pioneering the t-SNE dimensionality reduction technique and harnessing AI for novel protein synthesis, climate change mitigation, and wearable materials simulation. Join us to explore the transformative power of AI across diverse domains and gain a glimpse into its future societal implications.This episode is brought to you by AWS Inferentia, by Modelbit, for deploying models in seconds, and by Grafbase, the unified data layer. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.In this episode you will learn:• Large-scale learning of image recognition models on web data [05:05]• Evolutionary Scale Modeling protein models [16:45]• Fighting climate change by building an A.I. model [29:49]• The CrypTen privacy-preserving ML framework [38:36]• Concerns about adversarial examples [53:25]• Laurens’ t-SNE algorithm [58:56]• How to make a big impact [1:07:25]Additional materials: www.superdatascience.com/709

Aug 25, 2023 • 23min
708: ChatGPT Code Interpreter: 5 Hacks for Data Scientists
On this week’s Five-Minute Friday, host Jon Krohn gives five reasons why he is so excited about ChatGPT’s Code Interpreter and walks listeners through its capabilities with a practical example.Additional materials: www.superdatascience.com/708Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.