

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.
Episodes
Mentioned books

20 snips
Jun 13, 2022 • 40min
Data Governance for Data Science with Adam Wood - #578
In this engaging conversation, Adam Wood, Director of Data Governance and Data Quality at Mastercard, shares his insights into the complexities of data governance on a global scale. He discusses the challenges of navigating regional regulations like GDPR and the importance of metadata management. Adam highlights the role of feature stores in data lineage and emphasizes the need to mitigate bias in data science. Moreover, he sheds light on investments in tools that enhance data quality and collaboration within data science teams.

Jun 6, 2022 • 46min
Feature Platforms for Data-Centric AI with Mike Del Balso - #577
In this engaging conversation, Mike Del Balso, Co-founder and CEO of Tecton, shares insights from his experience building machine learning platforms at Uber. He discusses the evolution of data infrastructure, highlighting the shift to cloud-based systems and the importance of feature platforms. Del Balso elaborates on the ML Flywheel concept, emphasizing how data can supercharge machine learning. He also tackles the challenges of assembling effective ML teams, offering strategies to avoid common pitfalls and enhance collaboration across teams.

26 snips
May 30, 2022 • 51min
The Fallacy of "Ground Truth" with Shayan Mohanty - #576
Shayan Mohanty, CEO of Watchful and former Facebook systems architect, dives into the world of data-centric AI. He discusses the complexities of data labeling and the benefits of techniques like active learning and weak supervision to enhance efficiency. Shayan also explores the challenges organizations face with hand-labeled data and the biases that arise, emphasizing the need for a more integrated approach in machine learning operations. He shines a light on the critical mindset shift required to embrace this innovative strategy for better outcomes.

24 snips
May 23, 2022 • 48min
Principle-centric AI with Adrien Gaidon - #575
Adrien Gaidon, head of ML research at Toyota Research Institute, shares his insights on principle-centric AI and self-supervised learning. He presents an intriguing fourth perspective in the data-centric AI debate. The discussion covers innovative applications of synthetic data, particularly in autonomous vehicles, and the ethical challenges of machine learning. Adrien emphasizes integrating fundamental principles with data to foster advancements in AI, and how a curiosity-driven approach can enhance model robustness in high-stakes fields like healthcare.

24 snips
May 19, 2022 • 37min
Data Debt in Machine Learning with D. Sculley - #574
D. Sculley, a director on the Google Brain team known for his insights on technical debt in machine learning, dives into the evolving concept of data debt. He discusses the integral role data quality plays in data-centric AI and highlights common sources of data debt. The conversation touches on innovative strategies like causal inference graphs and stress testing for improving model robustness. Sculley also explores the community's proactive steps to mitigate these issues, emphasizing a shift towards more accountable data practices.

May 16, 2022 • 39min
AI for Enterprise Decisioning at Scale with Rob Walker - #573
Rob Walker, VP at Pegasystems, returns to share his expertise in AI and machine learning for customer engagement. He tackles the 'next best' decision-making dilemma and distinguishes it from recommender systems. The conversation dives into machine learning's coexistence with heuristic methods and tackles challenges around responsible AI practices. Rob also discusses the significance of feature stores and the balance between traditional models and deep learning, all while gearing up for the upcoming PegaWorld conference.

17 snips
May 12, 2022 • 42min
Data Rights, Quantification and Governance for Ethical AI with Margaret Mitchell - #572
Meg Mitchell, Chief Ethics Scientist at Hugging Face, dives into the crucial interplay between ethical AI and data governance. She discusses her transition from big tech to prioritizing coding in her current role, emphasizing the importance of diverse data representation. Meg highlights evolving data curation practices, ethical documentation through Model Cards, and the pressing need for transparency to mitigate biases in AI. The conversation also touches on challenges in distinguishing AI-generated content from human-written material, raising concerns about misinformation.

11 snips
May 9, 2022 • 53min
Studying Machine Intelligence with Been Kim - #571
Been Kim, a staff research scientist at Google Brain and ICLR 2022 speaker, dives into the fascinating world of AI interpretability. She discusses the current state of interpretability techniques, exploring how Gestalt principles can enhance our understanding of neural networks. Been proposes a novel language for human-AI communication, aimed at improving collaboration and transparency. The conversation also touches on the evolution of AI tools, the unique insights from AlphaZero in chess, and the implications of model fingerprints for data privacy.

May 2, 2022 • 38min
Advances in Neural Compression with Auke Wiggers - #570
Auke Wiggers, an AI research scientist at Qualcomm, dives into the exciting realm of neural data compression. He discusses how generative models and transformer architectures are revolutionizing image and video coding. The conversation highlights the shift from traditional techniques to neural codecs that learn from examples, and the impressive real-time performance on mobile devices. Auke also touches on innovations like transformer-based transform coding and shares insights from recent ICLR papers, showcasing the future of efficient data compression.

10 snips
Apr 25, 2022 • 46min
Mixture-of-Experts and Trends in Large-Scale Language Modeling with Irwan Bello - #569
Irwan Bello, a research scientist formerly with Google Brain and now part of a stealth AI startup, dives into the world of sparse expert models. He discusses his recent work on designing effective architectures that improve performance while managing computational costs. The conversation uncovers how the mixture-of-experts technique can extend beyond NLP to various tasks, including vision. Bello also shares insights on enhancing alignment in language models through instruction tuning and the challenges of optimizing these large-scale systems.


