The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Sam Charrington
undefined
92 snips
Feb 18, 2025 • 53min

π0: A Foundation Model for Robotics with Sergey Levine - #719

In this discussion, Sergey Levine, an associate professor at UC Berkeley and co-founder of Physical Intelligence, dives into π0, a groundbreaking general-purpose robotic foundation model. He explains its innovative architecture that combines vision-language models with a novel action expert. The conversation touches on the critical balance of training data, the significance of open-sourcing, and the impressive capabilities of robots like folding laundry effectively. Levine also highlights the exciting future of affordable robotics and the potential for diverse applications.
undefined
310 snips
Feb 10, 2025 • 1h 45min

AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718

Victor Dibia, a Principal Research Software Engineer at Microsoft Research, joins to discuss the future of AI agents and multi-agent systems. He highlights how these systems surpass traditional software with their reasoning and adaptability. The conversation dives into the rise of agentic foundation models, evaluating their performance, and the growing enterprise applications. Victor also shares insights on implementing successful AI architectures, the impact on software engineering, and the importance of human-AI collaboration in navigating these advancements.
undefined
24 snips
Feb 4, 2025 • 1h 17min

Speculative Decoding and Efficient LLM Inference with Chris Lott - #717

In this discussion, Chris Lott, Senior Director of Engineering at Qualcomm AI Research, dives into the complexities of accelerating large language model inference. He details the challenges of encoding and decoding, alongside hardware constraints like memory bandwidth and performance metrics. Lott shares innovative techniques for boosting efficiency, such as KV compression and speculative decoding. He also envisions the future of AI on edge devices, emphasizing the importance of small language models and integrated orchestrators for seamless user experiences.
undefined
71 snips
Jan 28, 2025 • 52min

Ensuring Privacy for Any LLM with Patricia Thaine - #716

Patricia Thaine, co-founder and CEO of Private AI, specializes in privacy-preserving AI techniques. She dives into the critical issues of data minimization, the risks of personal data leakage from large language models (LLMs), and the challenges of redacting sensitive information across different formats. Patricia highlights the limitations of data anonymization, the balance between real and synthetic data for model training, and the evolving landscape of AI regulations like GDPR. She also discusses the ethical considerations surrounding bias in AI and the future of privacy in technology.
undefined
152 snips
Jan 21, 2025 • 58min

AI Engineering Pitfalls with Chip Huyen - #715

In this insightful discussion, Chip Huyen, an independent AI researcher and the author of "AI Engineering," dives into the intricacies of AI engineering versus traditional machine learning. She highlights common pitfalls in AI systems and the critical nature of effective planning. The conversation also touches on AI agents, their limitations, and the significance of rigorous evaluation processes. Additionally, Chip explores the growing trend of open-source models and the exciting potential of synthetic data, along with her predictions for AI advancements by 2025.
undefined
162 snips
Jan 13, 2025 • 58min

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Abhijit Bose, Head of Enterprise AI and ML platforms at Capital One, shared insights into the evolution of their generative AI platform. He discussed the transition to a platform-centric approach in finance and the integration challenges faced by MLOps. Bose delved into optimizing Llama models for improved customer service and the role of Kubernetes in enhancing machine learning workflows. He also highlighted the significance of cloud architecture in AI experimentation and the new skill sets required for thriving in the generative AI landscape.
undefined
159 snips
Dec 16, 2024 • 1h 9min

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

In this discussion, Dan Jeffries, Founder and CEO of Kentauros AI, sheds light on the complexities of developing intelligent agents. He shares his innovative 'big brain, little brain, tool brain' strategy for tackling AI real-world challenges and explores the trade-offs between general-purpose and task-specific models. Dan emphasizes the significance of open source in advancing AI technologies, the role of human involvement in creating robust agents, and the promising yet challenging future of intelligent agents.
undefined
145 snips
Dec 9, 2024 • 57min

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

Byron Cook, VP and distinguished scientist at AWS's Automated Reasoning Group, dives into automated reasoning techniques designed to mitigate hallucinations in LLMs. He discusses the newly announced Automated Reasoning Checks and their mathematical foundations for safeguarding accuracy in generated text. Byron highlights breakthroughs in NP-complete problem-solving, integration with reinforcement learning, and unique applications in security and cryptography. He also shares insights on the future co-evolution of automated reasoning and generative AI, emphasizing collaboration and innovation.
undefined
21 snips
Dec 3, 2024 • 55min

AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711

Arash Behboodi, Director of Engineering at Qualcomm AI Research, discusses what's on the agenda for this year's NeurIPS conference. He highlights the challenges of differentiable simulation, particularly in wireless systems, and dives into how uncertainty quantification can enhance machine learning models through conformal prediction and entropy. Behboodi also previews innovative demos like on-device video editing and 3D content generation, showcasing Qualcomm's commitment to making cutting-edge AI accessible.
undefined
22 snips
Nov 19, 2024 • 54min

AI for Network Management with Shirley Wu - #710

Shirley Wu, Senior Director of Software Engineering at Juniper Networks, leads the discussion on the transformative power of AI and ML in network management. She shares insights on diagnosing cable issues and proactive coverage monitoring, highlighting the benefits of integrating data science. The conversation covers the evolution of roles for network administrators and the innovative Marvis chatbot, which enhances user engagement. Shirley also discusses future directions like proactive testing and the application of smaller ML models to improve efficiency and cost management.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app