

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.
Episodes
Mentioned books

8 snips
Jul 10, 2023 • 38min
Privacy vs Fairness in Computer Vision with Alice Xiang - #637
Alice Xiang, a Lead Research Scientist at Sony AI and Global Head of AI Ethics at Sony Group Corporation, shares her expertise on the critical issues of privacy and fairness in computer vision. She discusses the impact of data privacy laws and the dangers of unauthorized data use, emphasizing the importance of ethical practices in AI. Alice highlights the history of unethical data collection and the challenges posed by generative technologies. Solutions such as community engagement and interdisciplinary collaboration are also explored, alongside the need for robust AI regulation.

59 snips
Jul 3, 2023 • 48min
Unifying Vision and Language Models with Mohit Bansal - #636
In this engaging discussion, Mohit Bansal, a Parker Professor and Director of the MURGe-Lab at UNC, dives into the unification of vision and language models. He highlights the benefits of shared knowledge in AI, introducing innovative models like UDOP and VL-T5 that achieve top results with fewer parameters. The conversation also tackles the challenges of evaluating generative AI, addressing biases and the importance of data efficiency. Mohit shares insights on balancing advancements in multimodal models with responsible usage and the future of explainability in AI.

Jun 26, 2023 • 53min
Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635
Fatih Porikli, Senior Director of Technology at Qualcomm AI Research, shares insights from over 30 years in computer vision. He explores cutting-edge topics such as data augmentation techniques, optimized architectures, and advances in optical flow for video analysis. The conversation delves into the use of language models for fine-grained labeling, enhancing 3D object detection, and the role of generative AI in model efficiency. Fatih also discusses training neural networks and innovative approaches to integrating various data sources for improved accuracy.

38 snips
Jun 19, 2023 • 57min
Mojo: A Supercharged Python for AI with Chris Lattner - #634
In a captivating discussion, Chris Lattner, co-founder and CEO of Modular AI and creator of the Swift programming language, dives into Mojo, a groundbreaking programming language designed for AI developers. He explains how Mojo bridges the gap between Python's ease of use and C++'s performance, tackling the limitations posed by Python, particularly the global interpreter lock. Lattner emphasizes Mojo's compatibility with existing Python libraries, its potential to enhance AI workflows, and the need for a unified approach in AI model deployment.

10 snips
Jun 12, 2023 • 40min
Stable Diffusion and LLMs at the Edge with Jilei Hou - #633
Jilei Hou, VP of Engineering at Qualcomm Technologies, specializes in information theory and signal processing. He discusses the rise of generative AI and the advancement of deploying these models on edge devices. Challenges like model size and inference latency are highlighted, alongside solutions like quantization for optimizing performance. The conversation also dives into local optimization techniques that drastically reduce computation times for diffusion models. Jilei emphasizes the need for multimodal models, reshaping AI interactions and future innovations.

22 snips
Jun 5, 2023 • 47min
Modeling Human Behavior with Generative Agents with Joon Sung Park - #632
Joon Sung Park, a PhD student at Stanford University, is passionate about creating AI systems that address human challenges. He discusses his groundbreaking work on generative agents that mimic believable human behavior, emphasizing the role of context in AI interactions. The conversation delves into the complexities of long-term memory in agents and the significance of knowledge graphs for information retrieval. Joon also challenges traditional views on AI's worldview, exploring how emergent behaviors can reshape human-computer interaction.

6 snips
May 29, 2023 • 39min
Towards Improved Transfer Learning with Hugo Larochelle - #631
Hugo Larochelle, a research scientist at Google DeepMind, shares his groundbreaking work on transfer learning and neural knowledge mobilization. He dives into the significance of pre-training and fine-tuning in AI models, discussing the challenges and innovations in applying these techniques across diverse fields. Hugo also enlightens listeners on context-aware code generation and the evolution of large language models, revealing how they enhance code completion. Additionally, he sheds light on the creation of the Transactions on Machine Learning Research journal, advocating for more rigorous and open scientific publishing.

10 snips
May 22, 2023 • 28min
Language Modeling With State Space Models with Dan Fu - #630
Join Dan Fu, a PhD student at Stanford, as he dives into the evolving landscape of language modeling. He discusses the limitations of state space models and explores innovative techniques like Flash Attention, which enhances memory efficiency for processing longer sequences. Dan also shares insights on using synthetic languages to improve models and the quest for alternatives that outperform current attention-based methods. His research promises exciting advancements for the future of AI in understanding language.

4 snips
May 15, 2023 • 43min
Building Maps and Spatial Awareness in Blind AI Agents with Dhruv Batra - #629
Dhruv Batra, an associate professor at Georgia Tech and research director at Meta's FAIR team, shares groundbreaking insights on blind navigation agents. He discusses the emergence of maps within these agents and the importance of the embodiment hypothesis for true intelligence. The conversation explores the distinctions between cognitive and robotic mapping, innovations in AI's navigational capabilities using multilayer LSTMs, and the crucial role of memory in spatial awareness. Batra emphasizes the need for responsible data usage and the fascinating evolution of AI methodologies in navigation.

77 snips
May 8, 2023 • 41min
AI Agents and Data Integration with GPT and LLaMa with Jerry Liu - #628
Join Jerry Liu, co-founder and CEO of Llama Index, as he discusses the innovative creation of this platform that links external data with large language models. He shares insights on the challenges of integrating private data, the importance of automation in decision-making, and the evolution of AI agents. Liu also dives into strategies for optimizing complex queries and highlights the transformative potential of AI in processing unstructured data. Get ready to explore how technology can revolutionize data management!


