

MLOps.community
Demetrios
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Episodes
Mentioned books

14 snips
Nov 18, 2025 • 38min
Reliable Voice Agents
Brooke Hopkins, CEO of Coval and former Waymo lead, sheds light on the evolution of voice AI, highlighting its transition from niche to mainstream. She discusses the vital role of reliability in voice agents, emphasizing strategies like redundancy and latency monitoring to enhance user experience. They delve into practical applications such as customer support and healthcare, while also exploring innovative techniques for context retention and dynamic adjustments during conversations. The conversation also tackles the future of voice autonomy and its realistic timelines.

75 snips
Nov 14, 2025 • 41min
The Future of AI Operations: Insights from PwC AI Managed Services
Rani Radhakrishnan, a Principal at PwC, specializes in AI-managed services and data-driven transformation. She dives into how organizations are shifting from experimentation to realizing ROI with AI solutions. Topics include the need for process standardization, the role of data quality for AI agents, and the importance of human oversight in AI deployment. Rani also contrasts traditional managed services with AI-driven operations, emphasizing continuous optimization and the evolving skill sets required in today's tech landscape.

42 snips
Nov 11, 2025 • 1h 34min
The GPU Uptime Battle
In this engaging discussion, Andy Pernsteiner, Field CTO at VAST Data, dives into the complexities of building robust AI infrastructures. He highlights the critical gap between prototypes and production systems, emphasizing the importance of unified data and real-time processing. Andy reveals how GPU downtime can escalate costs dramatically and advocates for chaos engineering to ensure reliability. He also shares insights on workflow automation, the need for empathy between tech teams, and the advantages of separating logic from data for scalability. This conversation is a must-listen for anyone in the AI space!

36 snips
Nov 4, 2025 • 35min
The Evolution of AI in Cyber Security // Jeff Schwartzentruber // #344
Jeff Schwartzentruber, a Senior Machine Learning Scientist at eSentire, dives into the evolving landscape of AI in cybersecurity. He reveals the shift from signature-based detection to dynamic anomaly detection, tackling issues like alert fatigue in Security Operations Centers. The conversation explores the risks associated with AI agents, including prompt injections and the need for visibility. Jeff highlights how defenders and attackers use Generative AI, emphasizing the importance of maintaining organizational truth amid rising deception risks.

93 snips
Nov 3, 2025 • 38min
Thousands of Fine-Tuned Models
Jaipal Singh Goud, a Solutions Architect at Prem AI, dives into the exciting world of fine-tuning small language models for personalized AI agents. He discusses the contrast between general LLMs and company-specific models, addressing privacy and data control concerns. Jaipal also explores the complementary roles of fine-tuning and RAG systems in query improvement. He emphasizes the importance of user observation for fine-tuning decision-making patterns and envisions a future with countless personalized models, dynamically chosen for each task.

94 snips
Oct 24, 2025 • 51min
The Semantic Layer and AI Agents // David Jayatillake // #343
David Jayatillake, an experienced AI leader and former VP at Cube.dev, delves into the intricacies of semantic layers and their crucial role in data management. He critiques proprietary BI tools for locking companies into confusing ecosystems, advocating for open-source solutions. The discussion extends to how AI agents can streamline data workflows by automating repetitive tasks and enhancing queryability. Jayatillake also highlights the potential of LLMs in building semantic layers and the significance of company-specific definitions for effective data analysis.

127 snips
Oct 21, 2025 • 50min
Building Claude Code: Origin, Story, Product Iterations, & What's Next // Siddharth Bidasaria // #342
Siddharth Bidasaria, a key member of the Claude Code team at Anthropic, shares insights into the innovative coding product's journey. He reveals how Claude Code evolved from a terminal prototype, attracting immediate internal interest. The conversation highlights user-driven improvements like local file tools that enhanced workflow, and the importance of test-driven development for reliable AI code. Siddharth also discusses the balance between model steerability and user friction, plus exciting future possibilities with sub-agents and customizable permissions.

70 snips
Oct 14, 2025 • 51min
Building an Agentic AI Memory Framework
Biswaroop Bhattacharjee, a Senior ML Engineer at Prem AI, dives into the fascinating world of AI memory systems. He discusses Cortex, an innovative framework inspired by human cognition, highlighting how it manages long-term and multimodal memories. The conversation challenges the boundaries of agentic memory, weighing the necessity of forgetting and the implications of memory consolidation. Biswaroop also shares insights into hierarchical collections, retrieval techniques, and the pursuit of integrating vision and audio for a richer AI memory experience.

53 snips
Oct 7, 2025 • 51min
LLMs at Scale: Infrastructure That Keeps AI Safe, Smart & Affordable // Marco Palladino// # 341
Marco Palladino, CTO and co-founder of Kong, dives into the complexities of AI infrastructure. He discusses the importance of building AI gateways to enforce governance and security as technology evolves. The conversation touches on the role of agentic workloads and the challenges of MCP servers. Marco speculates on how agents could transform user interactions and even SEO dynamics. He also highlights real-world applications across industries and shares insights on product development strategies. Prepare for an enlightening exploration of AI's future!

24 snips
Oct 3, 2025 • 9min
Best AI Hackathon Project Ever? [Bite Size Episode]
A winning team at the hackathon reveals their groundbreaking AI travel agent that manages group trips from start to finish. They share insights on overcoming design challenges and integrating multiple agents. The conversation delves into secure, seamless payment systems without human intervention. User experience is highlighted with interactions via calls and WhatsApp. The use of automation for bookings is impressively detailed, showcasing their rapid traction and team's collaborative spirit. Join them as they invite fellow developers to innovate with Unicorn Mafia!


