The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Sam Charrington

Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.

Episodes

Mentioned books

Jul 10, 2023 • 38min

Privacy vs Fairness in Computer Vision with Alice Xiang - #637

Alice Xiang, a Lead Research Scientist at Sony AI and Global Head of AI Ethics at Sony Group Corporation, shares her expertise on the critical issues of privacy and fairness in computer vision. She discusses the impact of data privacy laws and the dangers of unauthorized data use, emphasizing the importance of ethical practices in AI. Alice highlights the history of unethical data collection and the challenges posed by generative technologies. Solutions such as community engagement and interdisciplinary collaboration are also explored, alongside the need for robust AI regulation.

Jul 3, 2023 • 48min

Unifying Vision and Language Models with Mohit Bansal - #636

In this engaging discussion, Mohit Bansal, a Parker Professor and Director of the MURGe-Lab at UNC, dives into the unification of vision and language models. He highlights the benefits of shared knowledge in AI, introducing innovative models like UDOP and VL-T5 that achieve top results with fewer parameters. The conversation also tackles the challenges of evaluating generative AI, addressing biases and the importance of data efficiency. Mohit shares insights on balancing advancements in multimodal models with responsible usage and the future of explainability in AI.

Jun 26, 2023 • 53min

Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635

Fatih Porikli, Senior Director of Technology at Qualcomm AI Research, shares insights from over 30 years in computer vision. He explores cutting-edge topics such as data augmentation techniques, optimized architectures, and advances in optical flow for video analysis. The conversation delves into the use of language models for fine-grained labeling, enhancing 3D object detection, and the role of generative AI in model efficiency. Fatih also discusses training neural networks and innovative approaches to integrating various data sources for improved accuracy.

Jun 19, 2023 • 57min

Mojo: A Supercharged Python for AI with Chris Lattner - #634

In a captivating discussion, Chris Lattner, co-founder and CEO of Modular AI and creator of the Swift programming language, dives into Mojo, a groundbreaking programming language designed for AI developers. He explains how Mojo bridges the gap between Python's ease of use and C++'s performance, tackling the limitations posed by Python, particularly the global interpreter lock. Lattner emphasizes Mojo's compatibility with existing Python libraries, its potential to enhance AI workflows, and the need for a unified approach in AI model deployment.

Jun 12, 2023 • 40min

Stable Diffusion and LLMs at the Edge with Jilei Hou - #633

Jilei Hou, VP of Engineering at Qualcomm Technologies, specializes in information theory and signal processing. He discusses the rise of generative AI and the advancement of deploying these models on edge devices. Challenges like model size and inference latency are highlighted, alongside solutions like quantization for optimizing performance. The conversation also dives into local optimization techniques that drastically reduce computation times for diffusion models. Jilei emphasizes the need for multimodal models, reshaping AI interactions and future innovations.

Jun 5, 2023 • 47min

Modeling Human Behavior with Generative Agents with Joon Sung Park - #632

Joon Sung Park, a PhD student at Stanford University, is passionate about creating AI systems that address human challenges. He discusses his groundbreaking work on generative agents that mimic believable human behavior, emphasizing the role of context in AI interactions. The conversation delves into the complexities of long-term memory in agents and the significance of knowledge graphs for information retrieval. Joon also challenges traditional views on AI's worldview, exploring how emergent behaviors can reshape human-computer interaction.

May 29, 2023 • 39min

Towards Improved Transfer Learning with Hugo Larochelle - #631

Hugo Larochelle, a research scientist at Google DeepMind, shares his groundbreaking work on transfer learning and neural knowledge mobilization. He dives into the significance of pre-training and fine-tuning in AI models, discussing the challenges and innovations in applying these techniques across diverse fields. Hugo also enlightens listeners on context-aware code generation and the evolution of large language models, revealing how they enhance code completion. Additionally, he sheds light on the creation of the Transactions on Machine Learning Research journal, advocating for more rigorous and open scientific publishing.

May 22, 2023 • 28min

Language Modeling With State Space Models with Dan Fu - #630

Join Dan Fu, a PhD student at Stanford, as he dives into the evolving landscape of language modeling. He discusses the limitations of state space models and explores innovative techniques like Flash Attention, which enhances memory efficiency for processing longer sequences. Dan also shares insights on using synthetic languages to improve models and the quest for alternatives that outperform current attention-based methods. His research promises exciting advancements for the future of AI in understanding language.

May 15, 2023 • 43min

Building Maps and Spatial Awareness in Blind AI Agents with Dhruv Batra - #629

Dhruv Batra, an associate professor at Georgia Tech and research director at Meta's FAIR team, shares groundbreaking insights on blind navigation agents. He discusses the emergence of maps within these agents and the importance of the embodiment hypothesis for true intelligence. The conversation explores the distinctions between cognitive and robotic mapping, innovations in AI's navigational capabilities using multilayer LSTMs, and the crucial role of memory in spatial awareness. Batra emphasizes the need for responsible data usage and the fascinating evolution of AI methodologies in navigation.

May 8, 2023 • 41min

AI Agents and Data Integration with GPT and LLaMa with Jerry Liu - #628

Join Jerry Liu, co-founder and CEO of Llama Index, as he discusses the innovative creation of this platform that links external data with large language models. He shares insights on the challenges of integrating private data, the importance of automation in decision-making, and the evolution of AI agents. Liu also dives into strategies for optimizing complex queries and highlights the transformative potential of AI in processing unstructured data. Get ready to explore how technology can revolutionize data management!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner