
Mixture of Experts
Welcome to Mixture of Experts, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and their impact on business. From breakthrough research to practical applications, each episode offers a balanced blend of expertise and analysis. Explore how AI is reshaping industries, driving efficiency, and unlocking new opportunities for growth. Whether you're a seasoned professional seeking to stay ahead of the curve or an enthusiast curious about the future of technology, Mixture of Experts delivers the perfect mix of insights and practical knowledge. Tune in and stay informed as we navigate the dynamic intersection of AI and business.
Latest episodes

8 snips
Apr 18, 2025 • 42min
o3 and o4-mini, Google Gemini on-prem and NVIDIA’s U.S. chip manufacturing
OpenAI just dropped o3 and o4-mini! In episode 51 of Mixture of Experts host, Tim Hwang is joined by Chris Hay, Vyoma Gajjar and special guest John Willis, Owner of Botchagalupe Technologies. Today, we analyze Sam Altman’s new AI models, o3 and o4-mini. Next, Google announced that by Q3 you can run Gemini on-prem; what does this mean for enterprise AI adoption? Then, John is on the show today to take us through AI evaluation tools and why we need them. Finally, NVIDIA is planning to move AI chip manufacturing to the U.S. Can they pull this off? All that and more on today’s Mixture of Experts. 00:01 – Intro 00:56 – OpenAI o3 and o4 mini 14:57 – Google Gemini on-prem 23:43 – AI evaluation tools 34:59 – NVIDIA's U.S. chip manufacturing The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

10 snips
Apr 11, 2025 • 38min
AI on IBM z17, Meta's Llama 4 and Google Cloud Next 2025
Join IBM trailblazers Hillery Hunter, an IBM Fellow and CTO of IBM Infrastructure, Shobhit Varshney, Head of Data and AI for the Americas, and Kate Soule, Director of Technical Product Management at Granite, as they dive into the launch of IBM z17 with its cutting-edge AI capabilities. Explore the unveiling of Meta's Llama 4, the innovations at Google Cloud Next, and the evolving perceptions of AI from the Pew Research Center. They tackle everything from zero downtime in financial transactions to AI's role in entertainment and industry dynamics.

Apr 4, 2025 • 43min
OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents
This discussion features Aaron Baughman, an esteemed IBM Fellow known for his innovative work in AI, Ash Minhas, a Lead AI advocate focusing on model interpretability, and Chris Hay, the CTO of Customer Transformation with insights into open-source AI. They delve into whether OpenAI will fully transition to open source by 2027. The conversation also highlights Anthropic's advancements in AI interpretability while critiquing Apple's AI development and exploring Amazon's emerging AI strategies. A compelling mix of insights awaits!

9 snips
Mar 28, 2025 • 42min
DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation
In this discussion, Kate Soule, Director at Granite, and Kush Varshney, IBM Fellow in AI Governance, dive into the future of open-source AI models. They talk about the latest DeepSeek-V3-0324 release and its implications for model evaluation beyond traditional benchmarks. The conversation shifts to Google's innovative Gemini Canvas and 2.5 feature, enhancing real-time coding experiences. They also explore the rising trends in AI image generation with OpenAI's GPT-4o, analyzing its cultural impact and ethical considerations.

15 snips
Mar 21, 2025 • 39min
NVIDIA GTC, Baidu reasoning models, and Gemini AI image generation
In this discussion, AI technical solutions architect Vyoma Gajjar sheds light on the latest advancements from NVIDIA GTC, particularly the groundbreaking Groot N1 humanoid robotics model. Principal research scientist Kaoutar El Maghraoui explores Baidu's new AI reasoning models and their controversial closed-source nature. AI researcher Nathalie Baracaldo addresses the reliability of Chain-of-Thought reasoning in AI, highlighting potential biases. The conversation also touches on Google’s Gemini innovations in AI image generation, exploring competition in the evolving tech landscape.

6 snips
Mar 14, 2025 • 50min
Manus, vibe coding, scaling laws and Perplexity’s AI phone
Join Chris Hay, a distinguished engineer and CTO of Customer Transformation; Kaoutar El Maghraoui, principal research scientist at the AI Hardware Center; and Vyoma Gajjar, AI technical solutions architect, as they delve into the exciting world of AI. They discuss Manus AI's potential to disrupt the tech landscape and the peculiar rise of vibe coding. Also on the agenda are new insights into scaling laws that challenge traditional views and the innovative collaboration behind the upcoming AI phone from Perplexity and Deutsche Telekom.

13 snips
Mar 7, 2025 • 45min
Quantum leap, Model Context Protocol, CoreWeave IPO and an AI voice companion
Blake Johnson, a distinguished engineer and quantum engine lead, shares insights on when quantum computing might make its way into consumer devices. Chris Hay, CTO of Customer Transformation, discusses the Model Context Protocol, which streamlines developer workflows. Volkmar Uhlig, AI infrastructure expert, talks about CoreWeave's shift from crypto to AI cloud services and its competitive edge. They also dive into Sesame AI's innovative voice companion, revealing how these technologies are reshaping user interaction with AI.

27 snips
Mar 1, 2025 • 24min
Bonus: OpenAI GPT-4.5: And the future of pre-training is...
In this insightful discussion, Kate Soule, a veteran in AI, and Chris Hay, an experienced AI analyst, dive deep into the unveiling of OpenAI's GPT-4.5. They explore whether pre-training is becoming obsolete, examining the shift toward inference-focused models. Insights on model selection and the balance of cost versus performance are highlighted. Additionally, they tackle the evolving dynamics of AI pricing and the impact of sophisticated tools on user experience. This conversation is a must-listen for anyone interested in the future of AI.

Feb 28, 2025 • 40min
Episode 44: Claude 3.7 Sonnet, BeeAI agents, Granite 3.2, and emergent misalignment
Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.’ All that and more on this week's episode! 00:01 – Intro 00:41 – Claude 3.7 Sonnet 11:58 – BeeAI agents 20:11– Granite 3.2 29:23 – Emergent misalignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

11 snips
Feb 21, 2025 • 46min
Episode 43: Deep Research, OpenAI inference chip, small VLMs, and AI agent job posting
In this discussion, Kate Soule, an AI and vision language models expert from Granite, Volkmar Uhlig, who leads AI infrastructure at a major firm, and Shobhit Varshney, a senior consultant on AI in the Americas, dive into the burgeoning world of deep research in AI. They explore exciting innovations from OpenAI and Google, the prospects of an OpenAI inference chip, and the rise of smaller vision-language models. The conversation wraps up with an intriguing look at a startup's quest for AI agents, pondering their role in the job market of the future.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.