

MLOps.community
Demetrios
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Episodes
Mentioned books

6 snips
Apr 4, 2025 • 49min
Streaming Ecosystem Complexities and Cost Management // Rohit Agrawal // #302
Rohit Agarwal, Director of Engineering at Tecton, shares his expertise in streaming data management and ML challenges. He discusses the complexity of navigating both real-time and batch data systems. Rohit highlights the financial implications of tool fragmentation, the evolution of managed services, and the importance of collaboration among data teams. He also explores the emerging trend of Bring Your Own Cloud solutions for enhanced data security. Lastly, he touches on simplifying data processing paradigms and the future of data storage technologies.

13 snips
Apr 1, 2025 • 41min
Fraud Detection in the AI Era // Rafael Sandroni // #301
Rafael Sandroni, Founder and CEO of GardionAI, dives into the realm of AI security and fraud detection. He discusses the importance of establishing robust security guardrails to combat vulnerabilities like prompt injection attacks. The conversation highlights the zero trust framework as essential for safeguarding AI systems, particularly in the financial sector. Sandroni also contrasts traditional fraud detection with modern AI methods, shedding light on the evolving landscape of cybersecurity and the need for proactive measures.

33 snips
Mar 30, 2025 • 55min
Beyond the Matrix: AI and the Future of Human Creativity
Fausto Albers, co-founder of the AI Builders Community, dives into the fascinating interplay between AI and human creativity. He discusses how AI can transform job interviews and enhance personal assistants, emphasizing context-aware systems. Albers shares insights on cognitive load reduction through intelligent suggestions, the balance between chaos and structure in innovation, and the challenges of understanding user context in AI interactions. His insights highlight the potential for AI to optimize decision-making and foster collaboration.

Mar 28, 2025 • 59min
Efficient GPU infrastructure at LinkedIn // Animesh Singh // MLOps Podcast #299
Animesh Singh, Executive Director of AI and ML Platform at LinkedIn, leads the charge in evolving AI technologies. He dives into the transformative impact of large language models on recruitment, highlighting LinkedIn's Hiring Assistant. Animesh also discusses the financial challenges of GPU infrastructure, emphasizing the need for optimization strategies. The conversation touches on real-time training and the intricate balance between scaling AI advancements and managing costs, offering insights into the future of AI and infrastructure innovations.

19 snips
Mar 25, 2025 • 47min
Building Trust Through Technology: Responsible AI in Practice // Allegra Guinan // #298
Allegra Guinan, Co-founder and CTO of Lumiera, dives into the nuances of Responsible AI. She emphasizes the need to integrate responsible practices deeply into organizational culture, rather than merely ticking compliance boxes. The conversation covers how to navigate transparency and explainability challenges, the importance of inclusivity in AI development, and adapting to failures in technology. Allegra also highlights the necessity of balancing innovation with human experiences in a rapidly personalizing world, reaffirming that curiosity and collaboration are key in leadership.

9 snips
Mar 21, 2025 • 47min
Claude Plays Pokémon - A Conversation with the Creator // David Hershey // #294
David Hershey, a Member of Technical Staff at Anthropic, dives into his innovative project where Claude plays Pokémon. He shares insights on crafting AI experiences, the joys of iterative experimentation, and the challenges of managing information overload in gameplay. The discussion covers the nuances of fine-tuning versus prompt engineering and the potential of AI agents in various professional domains. Hershey also explores the unpredictable future of AI, emphasizing its transformative power across industries.

62 snips
Mar 18, 2025 • 1h 5min
From Rules to Reasoning Engines // George Mathew // #296
George Mathew, Managing Director at Insight Partners, is a veteran in venture stage investments in AI and data companies. He discusses the rapid evolution of AI, spotlighting game-changers like ChatGPT and the shift from rule-based systems to AI-driven reasoning engines. George highlights the significance of high-quality data and innovative models like Deep SEQ. He envisions AI fundamentally altering business operations while navigating the balance between creativity and reliability in AI outputs, and shares insights on future integrated AI assistants.

10 snips
Mar 14, 2025 • 1h 6min
GenAI Traffic: Why API Infrastructure Must Evolve... Again // Erica Hughberg // #296
Join Erica Hughberg, Community Advocate at Tetrate, as she dives into the evolution of internet connectivity and its profound impact on AI. The conversation covers the shift from thread-based to event-driven web architectures and the transition from monolithic systems to microservices. Erica highlights how optimizing API requests with Envoy can enhance performance for large language models. She also underscores the importance of community collaboration and proactive solutions in navigating the complexities of evolving AI challenges and infrastructure.

19 snips
Mar 11, 2025 • 54min
The Unbearable Lightness of Data // Rohit Krishnan // #295
Rohit Krishnan, Chief Product Officer at Bodo.AI, shares his expertise on the evolving landscape of AI and its intersection with data engineering. He discusses innovative reasoning models and user interaction's role in enhancing AI outputs. Rohit delves into Bodo.AI’s open-source transition, which aims to revolutionize data analytics, and addresses the challenges of AI's impact on jobs. The conversation also speculates on the future of workload management and the crucial distinctions between AI and ML engineers in today’s tech environment.

12 snips
Mar 7, 2025 • 52min
Kubernetes, AI Gateways, and the Future of MLOps // Alexa Griffith // #294
Alexa Griffith, a Senior Software Engineer at Bloomberg, shares her journey in building scalable ML inference platforms and contributing to open-source projects. She discusses the evolution of workflow tools, comparingAirflow with new solutions like CoopFlow and Argo. Highlighting the Envoy AI Gateway, she addresses the challenges of managing AI traffic. Alexa also emphasizes the importance of aligning tech work with business goals, optimizing GPU utilization, and fostering effective communication between teams for successful AI deployments.