MLOps.community

Demetrios

Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)

Episodes

Mentioned books

Jan 2, 2026 • 45min

Computers that Think and Take Actions for You

Zengyi Qin, founder of the OpenAGI Foundation, discusses groundbreaking technology enabling computers to think and act autonomously. He details innovative training methods using large-scale sandboxes and real-world applications, highlighting Lux, a model that outperforms competitors like Gemini. Zengyi explains the agent framework for efficient task management and envisions AI replacing traditional input methods in the near future. He emphasizes the importance of continuous model improvement to tackle performance challenges, paving the way for a new era of human-computer interaction.

Dec 28, 2025 • 29min

Real time features, AI search, Agentic similarities

Varant Zanoyan, Co-founder and CEO of Zipline AI, and Nikhil Simha Raprolu, Co-founder and CTO at Zipline AI, delve into the evolution of AI infrastructure. They share insights on the compute-first approach of Cronon that emerged from Airbnb, emphasizing real-time features over traditional storage models. The duo explains the complexities of orchestrating signals and pipelines, the challenges of point-in-time correctness, and the importance of governance. They also discuss how Cronon integrates embeddings and real-time workflows, reflecting on its open-sourcing journey with Stripe.

Dec 23, 2025 • 58min

Tool definitions are the new Prompt Engineering

In this discussion, guests Chiara Caratelli and Alex Salazar dive into the intricacies of AI agent tooling. Chiara shares insights from her work at Prosus, focusing on UX and the challenges of clarity in tool definitions, highlighting how ambiguity can lead to edge cases. Alex contrasts traditional APIs with dynamic tools that capture agent intentions to minimize latency. Together, they explore the balance of tool usage, governance, and the future of multi-agent systems, advocating for a structured approach to ensure efficient production-ready AI agents.

Dec 19, 2025 • 58min

The Future of AI Agents is Sandboxed

Jonathan Wall, CEO of Runloop.ai and a former Google engineer, dives into the future of AI agents and the importance of sandboxed environments. He reveals how sandboxes create safe spaces for agents to operate while preventing security risks. Wall discusses building efficient agent infrastructures, the advantages of isolated compute environments, and the significance of creating Git-like workflows for enterprises. He also explores how refining agent performance through failed runs can drive iterative advancements, revolutionizing how agents interact with data and each other.

Dec 16, 2025 • 46min

Context engineering 2.0, Agents + Structured Data, and the Redis Context Engine

Simba Khadder, founder of Featureform and now at Redis, dives into the fascinating world of context engineering for AI. He argues that context, not models, is the real bottleneck for agents. Simba discusses the evolution of feature stores, emphasizing their ongoing value amidst changing ML economics. He introduces a GraphQL-style semantic layer for better data navigation and details how Redis powers these systems with robust capabilities. Plus, he shares insights on how to improve agent functionality by enhancing context access.

Dec 12, 2025 • 1h 2min

Does AgenticRAG Really Work?

In this engaging discussion, Satish Bhambri, a Senior Data Scientist at Walmart Labs, dives deep into the evolution of machine learning, exploring the transition from RNNs to transformers. He sheds light on the emergence of RAG systems and their ability to ground large language models, tackling issues like hallucinations. Satish explains the benefits of agentic RAG for creating specialized agents and the trade-offs between APIs and agents. He also shares insights on vector database selection and the importance of data freshness in recommendation systems.

Dec 10, 2025 • 1h 4min

How Sierra AI Does Context Engineering

Zack Reneau-Wedeen, Head of Product at Sierra, shares insights on revolutionizing AI with context engineering, prioritizing real-world testing over traditional methods. He reveals how AI often feels like a moody coworker and discusses the importance of robust simulations to enhance reliability. Zack advocates for abandoning decision trees in favor of goal-oriented frameworks and explains how Sierra trains graduates to be product-engineering hybrids. He also emphasizes the significance of customer focus to improve AI agents and discusses innovative strategies for scaling and fine-tuning voice interactions.

Dec 5, 2025 • 54min

Overcoming Challenges in AI Agent Deployment: The Sweet Spot for Governance and Security // Spencer Reagan // #349

Spencer Reagan, R&D lead at Airia, specializes in AI-agent orchestration and data governance for regulated environments. He dives into the complexities of agent deployment, discussing how messy data impacts AI performance and why many AI platforms struggle to scale. Reagan offers insights on monitoring agents' actions, enhancing trust through frequent oversight, and emphasizing automation in marketing and HR. He highlights the importance of dynamic rules and identity management for secure agent operations, sharing practical analogies to improve agent design.

Dec 2, 2025 • 29min

Hardening Agents for E-commerce Scale: From RL Alignment to Reliability // Panel 2

In this engaging discussion, expert panelists share insights into the world of e-commerce agents. Arushi Jain, a Microsoft applied scientist, delves into post-training techniques that enhance AI reliability for tasks. Swati Bhatia from Google Cloud talks about using Direct Preference Optimization to fine-tune support routing. Audi Liu from Inworld AI discusses architectural trade-offs in voice models for better accuracy. Isabella Piratininga of iFood highlights personalization challenges in Brazil. Together, they explore the complexities of automating customer interactions and the future of AI.

Nov 27, 2025 • 27min

Building Cursor: A Fireside Chat with VP Solutions Ricky Doar

Ricky Doar, VP of Solutions at Cursor, brings a wealth of experience from leading AI developer tool implementations. He discusses AI as a learnable engineering skill and emphasizes the importance of understanding model capabilities. Ricky warns against over-reliance on AI for strategic decisions and highlights best practices for working with existing codebases. He also shares insights on managing context windows and when to trust AI suggestions, ultimately guiding engineers to make informed decisions while harnessing AI's potential.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner