The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Latest episodes

undefined
75 snips
Mar 3, 2025 • 49min

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721

Niklas Muennighoff, a PhD student at Stanford, dives into his groundbreaking work on the S1 reasoning model, designed to efficiently mimic OpenAI's O1 while costing under $50 to train. He elaborates on innovative techniques like 'budget forcing' that help the model tackle complex problems more effectively. The discussion highlights the intricacies of test-time scaling, the importance of data curation, and the differences between supervised fine-tuning and reinforcement learning. Niklas also shares insights on the future of open-sourced AI models.
undefined
29 snips
Feb 24, 2025 • 1h 7min

Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720

Ron Diamant, Chief Architect for Trainium at AWS, delves into the revolutionary Trainium2 chip designed for AI and ML acceleration. He discusses its unique systolic array architecture and how it outperforms traditional GPUs in key performance dimensions. The conversation highlights the ecosystem surrounding Trainium, including the Neuron SDK and its various provisioning options. Diamant also touches upon customer adoption, performance benchmarks, and future prospects for Trainium, showcasing its pivotal role in shaping AI training and inference.
undefined
57 snips
Feb 18, 2025 • 53min

π0: A Foundation Model for Robotics with Sergey Levine - #719

In this discussion, Sergey Levine, an associate professor at UC Berkeley and co-founder of Physical Intelligence, dives into π0, a groundbreaking general-purpose robotic foundation model. He explains its innovative architecture that combines vision-language models with a novel action expert. The conversation touches on the critical balance of training data, the significance of open-sourcing, and the impressive capabilities of robots like folding laundry effectively. Levine also highlights the exciting future of affordable robotics and the potential for diverse applications.
undefined
221 snips
Feb 10, 2025 • 1h 45min

AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718

Victor Dibia, a Principal Research Software Engineer at Microsoft Research, joins to discuss the future of AI agents and multi-agent systems. He highlights how these systems surpass traditional software with their reasoning and adaptability. The conversation dives into the rise of agentic foundation models, evaluating their performance, and the growing enterprise applications. Victor also shares insights on implementing successful AI architectures, the impact on software engineering, and the importance of human-AI collaboration in navigating these advancements.
undefined
24 snips
Feb 4, 2025 • 1h 17min

Speculative Decoding and Efficient LLM Inference with Chris Lott - #717

In this discussion, Chris Lott, Senior Director of Engineering at Qualcomm AI Research, dives into the complexities of accelerating large language model inference. He details the challenges of encoding and decoding, alongside hardware constraints like memory bandwidth and performance metrics. Lott shares innovative techniques for boosting efficiency, such as KV compression and speculative decoding. He also envisions the future of AI on edge devices, emphasizing the importance of small language models and integrated orchestrators for seamless user experiences.
undefined
59 snips
Jan 28, 2025 • 52min

Ensuring Privacy for Any LLM with Patricia Thaine - #716

Patricia Thaine, co-founder and CEO of Private AI, specializes in privacy-preserving AI techniques. She dives into the critical issues of data minimization, the risks of personal data leakage from large language models (LLMs), and the challenges of redacting sensitive information across different formats. Patricia highlights the limitations of data anonymization, the balance between real and synthetic data for model training, and the evolving landscape of AI regulations like GDPR. She also discusses the ethical considerations surrounding bias in AI and the future of privacy in technology.
undefined
116 snips
Jan 21, 2025 • 58min

AI Engineering Pitfalls with Chip Huyen - #715

In this insightful discussion, Chip Huyen, an independent AI researcher and the author of "AI Engineering," dives into the intricacies of AI engineering versus traditional machine learning. She highlights common pitfalls in AI systems and the critical nature of effective planning. The conversation also touches on AI agents, their limitations, and the significance of rigorous evaluation processes. Additionally, Chip explores the growing trend of open-source models and the exciting potential of synthetic data, along with her predictions for AI advancements by 2025.
undefined
145 snips
Jan 13, 2025 • 58min

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Abhijit Bose, Head of Enterprise AI and ML platforms at Capital One, shared insights into the evolution of their generative AI platform. He discussed the transition to a platform-centric approach in finance and the integration challenges faced by MLOps. Bose delved into optimizing Llama models for improved customer service and the role of Kubernetes in enhancing machine learning workflows. He also highlighted the significance of cloud architecture in AI experimentation and the new skill sets required for thriving in the generative AI landscape.
undefined
155 snips
Dec 16, 2024 • 1h 9min

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

In this discussion, Dan Jeffries, Founder and CEO of Kentauros AI, sheds light on the complexities of developing intelligent agents. He shares his innovative 'big brain, little brain, tool brain' strategy for tackling AI real-world challenges and explores the trade-offs between general-purpose and task-specific models. Dan emphasizes the significance of open source in advancing AI technologies, the role of human involvement in creating robust agents, and the promising yet challenging future of intelligent agents.
undefined
145 snips
Dec 9, 2024 • 57min

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

Byron Cook, VP and distinguished scientist at AWS's Automated Reasoning Group, dives into automated reasoning techniques designed to mitigate hallucinations in LLMs. He discusses the newly announced Automated Reasoning Checks and their mathematical foundations for safeguarding accuracy in generated text. Byron highlights breakthroughs in NP-complete problem-solving, integration with reinforcement learning, and unique applications in security and cryptography. He also shares insights on the future co-evolution of automated reasoning and generative AI, emphasizing collaboration and innovation.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode