Mixture of Experts cover image

Mixture of Experts

Latest episodes

undefined
8 snips
Apr 18, 2025 • 42min

o3 and o4-mini, Google Gemini on-prem and NVIDIA’s U.S. chip manufacturing

OpenAI just dropped o3 and o4-mini! In episode 51 of Mixture of Experts host, Tim Hwang is joined by Chris Hay, Vyoma Gajjar and special guest John Willis, Owner of Botchagalupe Technologies. Today, we analyze Sam Altman’s new AI models, o3 and o4-mini. Next, Google announced that by Q3 you can run Gemini on-prem; what does this mean for enterprise AI adoption? Then, John is on the show today to take us through AI evaluation tools and why we need them. Finally, NVIDIA is planning to move AI chip manufacturing to the U.S. Can they pull this off? All that and more on today’s Mixture of Experts. 00:01 – Intro 00:56 – OpenAI o3 and o4 mini 14:57 – Google Gemini on-prem 23:43 – AI evaluation tools 34:59 – NVIDIA's U.S. chip manufacturing   The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
undefined
10 snips
Apr 11, 2025 • 38min

AI on IBM z17, Meta's Llama 4 and Google Cloud Next 2025

Join IBM trailblazers Hillery Hunter, an IBM Fellow and CTO of IBM Infrastructure, Shobhit Varshney, Head of Data and AI for the Americas, and Kate Soule, Director of Technical Product Management at Granite, as they dive into the launch of IBM z17 with its cutting-edge AI capabilities. Explore the unveiling of Meta's Llama 4, the innovations at Google Cloud Next, and the evolving perceptions of AI from the Pew Research Center. They tackle everything from zero downtime in financial transactions to AI's role in entertainment and industry dynamics.
undefined
Apr 4, 2025 • 43min

OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents

This discussion features Aaron Baughman, an esteemed IBM Fellow known for his innovative work in AI, Ash Minhas, a Lead AI advocate focusing on model interpretability, and Chris Hay, the CTO of Customer Transformation with insights into open-source AI. They delve into whether OpenAI will fully transition to open source by 2027. The conversation also highlights Anthropic's advancements in AI interpretability while critiquing Apple's AI development and exploring Amazon's emerging AI strategies. A compelling mix of insights awaits!
undefined
9 snips
Mar 28, 2025 • 42min

DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation

In this discussion, Kate Soule, Director at Granite, and Kush Varshney, IBM Fellow in AI Governance, dive into the future of open-source AI models. They talk about the latest DeepSeek-V3-0324 release and its implications for model evaluation beyond traditional benchmarks. The conversation shifts to Google's innovative Gemini Canvas and 2.5 feature, enhancing real-time coding experiences. They also explore the rising trends in AI image generation with OpenAI's GPT-4o, analyzing its cultural impact and ethical considerations.
undefined
15 snips
Mar 21, 2025 • 39min

NVIDIA GTC, Baidu reasoning models, and Gemini AI image generation

In this discussion, AI technical solutions architect Vyoma Gajjar sheds light on the latest advancements from NVIDIA GTC, particularly the groundbreaking Groot N1 humanoid robotics model. Principal research scientist Kaoutar El Maghraoui explores Baidu's new AI reasoning models and their controversial closed-source nature. AI researcher Nathalie Baracaldo addresses the reliability of Chain-of-Thought reasoning in AI, highlighting potential biases. The conversation also touches on Google’s Gemini innovations in AI image generation, exploring competition in the evolving tech landscape.
undefined
6 snips
Mar 14, 2025 • 50min

Manus, vibe coding, scaling laws and Perplexity’s AI phone

Join Chris Hay, a distinguished engineer and CTO of Customer Transformation; Kaoutar El Maghraoui, principal research scientist at the AI Hardware Center; and Vyoma Gajjar, AI technical solutions architect, as they delve into the exciting world of AI. They discuss Manus AI's potential to disrupt the tech landscape and the peculiar rise of vibe coding. Also on the agenda are new insights into scaling laws that challenge traditional views and the innovative collaboration behind the upcoming AI phone from Perplexity and Deutsche Telekom.
undefined
13 snips
Mar 7, 2025 • 45min

Quantum leap, Model Context Protocol, CoreWeave IPO and an AI voice companion

Blake Johnson, a distinguished engineer and quantum engine lead, shares insights on when quantum computing might make its way into consumer devices. Chris Hay, CTO of Customer Transformation, discusses the Model Context Protocol, which streamlines developer workflows. Volkmar Uhlig, AI infrastructure expert, talks about CoreWeave's shift from crypto to AI cloud services and its competitive edge. They also dive into Sesame AI's innovative voice companion, revealing how these technologies are reshaping user interaction with AI.
undefined
27 snips
Mar 1, 2025 • 24min

Bonus: OpenAI GPT-4.5: And the future of pre-training is...

In this insightful discussion, Kate Soule, a veteran in AI, and Chris Hay, an experienced AI analyst, dive deep into the unveiling of OpenAI's GPT-4.5. They explore whether pre-training is becoming obsolete, examining the shift toward inference-focused models. Insights on model selection and the balance of cost versus performance are highlighted. Additionally, they tackle the evolving dynamics of AI pricing and the impact of sophisticated tools on user experience. This conversation is a must-listen for anyone interested in the future of AI.
undefined
Feb 28, 2025 • 40min

Episode 44: Claude 3.7 Sonnet, BeeAI agents, Granite 3.2, and emergent misalignment

Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with  new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.’ All that and more on this week's episode! 00:01 – Intro  00:41 – Claude 3.7 Sonnet 11:58 – BeeAI agents  20:11– Granite 3.2 29:23 – Emergent misalignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. 
undefined
11 snips
Feb 21, 2025 • 46min

Episode 43: Deep Research, OpenAI inference chip, small VLMs, and AI agent job posting

In this discussion, Kate Soule, an AI and vision language models expert from Granite, Volkmar Uhlig, who leads AI infrastructure at a major firm, and Shobhit Varshney, a senior consultant on AI in the Americas, dive into the burgeoning world of deep research in AI. They explore exciting innovations from OpenAI and Google, the prospects of an OpenAI inference chip, and the rise of smaller vision-language models. The conversation wraps up with an intriguing look at a startup's quest for AI agents, pondering their role in the job market of the future.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner