Super Data Science: ML & AI Podcast with Jon Krohn cover image

Super Data Science: ML & AI Podcast with Jon Krohn

Latest episodes

undefined
Apr 2, 2024 • 1h 55min

771: Gradient Boosting: XGBoost, LightGBM and CatBoost, with Kirill Eremenko

Kirill Eremenko joins Jon Krohn for another exclusive, in-depth teaser for a new course just released on the SuperDataScience platform, “Machine Learning Level 2”. Kirill walks listeners through why decision trees and random forests are fruitful for businesses, and he offers hands-on walkthroughs for the three leading gradient-boosting algorithms today: XGBoost, LightGBM, and CatBoost.This episode is brought to you by Ready Tensor, where innovation meets reproducibility, and by Data Universe, the out-of-this-world data conference. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• All about decision trees [09:17]• All about ensemble models [21:43]• All about AdaBoost [36:47]• All about gradient boosting [45:52]• Gradient boosting for classification problems [59:54]• Advantages of XGBoost [1:03:51]• LightGBM [1:17:06]• CatBoost [1:32:07]Additional materials: www.superdatascience.com/771
undefined
Mar 29, 2024 • 45min

770: The Neuroscientific Guide to Confidence

Explore the science of confidence with Lucy Antrobus, as she unveils neuroscience-backed strategies to build and boost confidence through practice, positive energy, and the power of laughter. An essential listen for fostering unshakable self-assurance.Additional materials: www.superdatascience.com/770Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
undefined
Mar 26, 2024 • 1h 49min

769: Generative AI for Medicine, with Prof. Zack Lipton

Generative AI in medicine takes center stage as Prof. Zachary Lipton, Chief Scientific Officer at Abridge, joins host Jon Krohn to discuss the significant advancements in AI that are reshaping healthcare.This episode is brought to you by the DataConnect Conference, and by Data Universe, the out-of-this-world data conference. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• The inspiration for Zack to get started in ML and healthcare [03:56]• The hardware required to use Abridge [12:29]• The key data science projects at Abridge right now [35:05]• Abridge's tech stack [59:54]• How Abridge ensures reliability in a high-stakes setting like healthcare [1:07:29]• How Zack’s academic research cross-pollinates with his commercial ML projects [1:21:05]• How Zack’s jazz background molded his entrepreneur and data science journey [1:30:32]Additional materials: www.superdatascience.com/769
undefined
Mar 22, 2024 • 13min

768: Is Claude 3 Better than GPT-4?

Claude 3, LLMs and testing ML performance: Jon Krohn tests out Anthropic’s new model family, Claude 3, which includes the Haiku, Sonnet and Opus models (written in order of their performance power, from least to greatest). Can it stand shoulder to shoulder with other models such as GPT-4 and Gemini 1.0 Ultra? And how important is it for machine learning practitioners to try out these models with their own benchmarks? Jon walks listeners through a test of his own in this Five-Minute Friday.Additional materials: www.superdatascience.com/768Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
undefined
Mar 19, 2024 • 1h 48min

767: Open-Source LLM Libraries and Techniques, with Dr. Sebastian Raschka

Jon Krohn sits down with Sebastian Raschka to discuss his latest book, Machine Learning Q and AI, the open-source libraries developed by Lightning AI, how to exploit the greatest opportunities for LLM development, and what’s on the horizon for LLMs.This episode is brought to you by the DataConnect Conference, and by Data Universe, the out-of-this-world data conference. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• All about Machine Learning Q and AI [04:13]• Sebastian Raschka’s role as Staff Research Engineer at Lightning AI [19:21]• PyTorch Lightning’s and Lightning Fabric’s capabilities [39:32]• Large language models: Opportunities and challenges [43:35]• DoRA vs LoRA [48:56]• How to be a successful AI educator [1:34:18]Additional materials: www.superdatascience.com/767
undefined
Mar 15, 2024 • 8min

766: Vonnegut's Player Piano (1952): An Eerie Novel on the Current AI Revolution

Kurt Vonnegut's "Player Piano" delivers striking parallels between its dystopian vision and today's AI challenges. This week, Jon Krohn explores the novel's depiction of a world where humans are marginalized by machines, reflecting on the impact of automation on society and the ethical considerations it raises. Tune in as we unpack the timeless relevance of Vonnegut's work to the AI era.Additional materials: www.superdatascience.com/766Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
undefined
Mar 12, 2024 • 1h 37min

765: NumPy, SciPy and the Economics of Open-Source, with Dr. Travis Oliphant

Explore the origins of NumPy and SciPy with their creator, Dr. Travis Oliphant. Discover the journey from personal need to global impact, the challenges overcome, and the future of these essential Python libraries in scientific computing and data science.This episode is brought to you by the DataConnect Conference, by Data Universe, the out-of-this-world data conference, and by CloudWolf, the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• Travis's journey to creating NumPy and SciPy [08:05]• How Anaconda got started [42:24]• How Numba, a high-performance Python compiler, was brought to market [54:48]• Python's influence on the thought processes of scientists and engineers [1:04:21]• The commercial projects that support Travis’s vast open-source efforts and communities [1:10:22]• How to get involved in Travis's commercial projects and communities [1:22:34]• The future of scientific computing and Python libraries [1:29:50]Additional materials: www.superdatascience.com/765
undefined
Mar 8, 2024 • 8min

764: The Top 10 Episodes of 2023

Data science futurists, bestselling authors, and lively how-to guides from the industry’s top practitioners, which range from applying data science for good to using open-source tools for NLP: This is The Super Data Science Podcast’s top ten most listened-to episodes in 2023, hosted by Jon Krohn. A great snapshot of our great content from 2023.Additional materials: www.superdatascience.com/764Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
undefined
Mar 5, 2024 • 1h 27min

763: The Best A.I. Startup Opportunities, with venture capitalist Rudina Seseri

At Glasswing Ventures, Rudina Seseri wants to be able to answer the question: What has Glasswing Ventures done for the company beyond capital investment? She speaks to Jon Krohn about how her company uses data to assess venture capital investments, the secret sauce of successful AI startups, and why she feels generative AI is only the start of a much broader impact that AI will make in communities and businesses.This episode is brought to you by the DataConnect Conference, and by Ready Tensor, where innovation meets reproducibility. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• Potential interest areas for Series A AI venture capitalists [12:22]• How Glasswing’s AI Palette helps AI startups [23:06]• How data driven the venture capital industry is [27:21]• Advice for adopting services from AI providers [47:21]• Model collapse: Causes and concerns [58:44]• Glasswing’s checklist for AI startups [1:04:59]Additional materials: www.superdatascience.com/763
undefined
Mar 1, 2024 • 17min

762: Gemini 1.5 Pro, the Million-Token-Context LLM

Jon Krohn presents an insightful overview of Google's groundbreaking Gemini Pro 1.5, a million-token LLM that's transforming the landscape of AI. Discover the innovative aspects of Gemini Pro 1.5, from its extensive context window to its multimodal functionalities, which are broadening the scope of AI technology and signifying a significant leap in data science. Plus, join Jon for a practical demonstration, showcasing the real-world applications, capabilities, and limitation of this advanced language model.Additional materials: www.superdatascience.com/762Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode