
MLOps.community
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Latest episodes

Mar 28, 2025 • 59min
Efficient GPU infrastructure at LinkedIn // Animesh Singh // MLOps Podcast #299
Animesh Singh, Executive Director of AI and ML Platform at LinkedIn, leads the charge in evolving AI technologies. He dives into the transformative impact of large language models on recruitment, highlighting LinkedIn's Hiring Assistant. Animesh also discusses the financial challenges of GPU infrastructure, emphasizing the need for optimization strategies. The conversation touches on real-time training and the intricate balance between scaling AI advancements and managing costs, offering insights into the future of AI and infrastructure innovations.

19 snips
Mar 25, 2025 • 47min
Building Trust Through Technology: Responsible AI in Practice // Allegra Guinan // #298
Allegra Guinan, Co-founder and CTO of Lumiera, dives into the nuances of Responsible AI. She emphasizes the need to integrate responsible practices deeply into organizational culture, rather than merely ticking compliance boxes. The conversation covers how to navigate transparency and explainability challenges, the importance of inclusivity in AI development, and adapting to failures in technology. Allegra also highlights the necessity of balancing innovation with human experiences in a rapidly personalizing world, reaffirming that curiosity and collaboration are key in leadership.

9 snips
Mar 21, 2025 • 47min
Claude Plays Pokémon - A Conversation with the Creator // David Hershey // #294
David Hershey, a Member of Technical Staff at Anthropic, dives into his innovative project where Claude plays Pokémon. He shares insights on crafting AI experiences, the joys of iterative experimentation, and the challenges of managing information overload in gameplay. The discussion covers the nuances of fine-tuning versus prompt engineering and the potential of AI agents in various professional domains. Hershey also explores the unpredictable future of AI, emphasizing its transformative power across industries.

37 snips
Mar 18, 2025 • 1h 5min
From Rules to Reasoning Engines // George Mathew // #296
George Mathew, Managing Director at Insight Partners, is a veteran in venture stage investments in AI and data companies. He discusses the rapid evolution of AI, spotlighting game-changers like ChatGPT and the shift from rule-based systems to AI-driven reasoning engines. George highlights the significance of high-quality data and innovative models like Deep SEQ. He envisions AI fundamentally altering business operations while navigating the balance between creativity and reliability in AI outputs, and shares insights on future integrated AI assistants.

10 snips
Mar 14, 2025 • 1h 6min
GenAI Traffic: Why API Infrastructure Must Evolve... Again // Erica Hughberg // #296
Join Erica Hughberg, Community Advocate at Tetrate, as she dives into the evolution of internet connectivity and its profound impact on AI. The conversation covers the shift from thread-based to event-driven web architectures and the transition from monolithic systems to microservices. Erica highlights how optimizing API requests with Envoy can enhance performance for large language models. She also underscores the importance of community collaboration and proactive solutions in navigating the complexities of evolving AI challenges and infrastructure.

19 snips
Mar 11, 2025 • 54min
The Unbearable Lightness of Data // Rohit Krishnan // #295
Rohit Krishnan, Chief Product Officer at Bodo.AI, shares his expertise on the evolving landscape of AI and its intersection with data engineering. He discusses innovative reasoning models and user interaction's role in enhancing AI outputs. Rohit delves into Bodo.AI’s open-source transition, which aims to revolutionize data analytics, and addresses the challenges of AI's impact on jobs. The conversation also speculates on the future of workload management and the crucial distinctions between AI and ML engineers in today’s tech environment.

12 snips
Mar 7, 2025 • 52min
Kubernetes, AI Gateways, and the Future of MLOps // Alexa Griffith // #294
Alexa Griffith, a Senior Software Engineer at Bloomberg, shares her journey in building scalable ML inference platforms and contributing to open-source projects. She discusses the evolution of workflow tools, comparingAirflow with new solutions like CoopFlow and Argo. Highlighting the Envoy AI Gateway, she addresses the challenges of managing AI traffic. Alexa also emphasizes the importance of aligning tech work with business goals, optimizing GPU utilization, and fostering effective communication between teams for successful AI deployments.

20 snips
Mar 4, 2025 • 54min
Future of Software, Agents in the Enterprise, and Inception Stage Company Building // Eliot Durbin // #293
In this engaging discussion, Eliot Durbin, General Partner at Boldstart Ventures, shares invaluable insights from his 15 years of investing in inception-stage companies. He dives into the unique qualities that drive successful founders, emphasizing their commitment to innovation over traditional metrics. The conversation explores the rapid evolution of AI and software agents, along with the challenges of integrating AI in enterprises. Eliot also touches on the shifting landscape of venture capital, highlighting the need for adaptability and innovative thinking in today's tech industry.

16 snips
Mar 3, 2025 • 48min
The Agent Exchange: Practitioner Insights
Dmitri Jarnikov, Senior Director of Data Science at Prosus, shares insights on AI and Gen AI products. Chiara Caratelli, a data scientist at Prosus Group, discusses the dynamic blend of generic and specialized agents. Steven Vester, Head of Product at OLX, predicts that trust in specialized agents will grow before generic ones prevail. The trio explores the challenges e-commerce platforms face in integrating AI agents and emphasizes the importance of building trust for successful adoption. Exciting opportunities for new agent-driven business models emerge!

42 snips
Feb 28, 2025 • 54min
Talk to Your Data: The SQL Data Analyst
Paul van der Boor, VP AI at Prosus Group, and Donné Stevenson, Machine Learning Engineer, dive into the fascinating world of the Token Data Analyst agent. They discuss the agent’s seamless integration into production, overcoming challenges like LLM overconfidence and query accuracy. The duo explores the importance of modular architecture for enhanced interaction and the hurdles of SQL queries amidst unclear definitions. Their insights into structured reasoning models promise a brighter future for data analysis, making complex tasks more efficient and accurate.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.