
MLOps.community
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Latest episodes

Jun 27, 2025 • 1h 37min
AI Reliability, Spark, Observability, SLAs and Starting an AI Infra Company
Kostas Pardalis and Yoni Michael, co-founders of Typedef, dive into the transformative power of AI and data in today’s business landscape. They discuss how LLMs are revolutionizing data pipelines and the vital role of AI Reliability Engineers in ensuring system stability. The duo emphasizes the need for robust infrastructure and collaboration between engineers and domain experts to tackle integration challenges. They also explore the evolving dynamics of AI in customer support and the importance of community in driving innovation in technology.

11 snips
Jun 24, 2025 • 49min
Greg Kamradt: Benchmarking Intelligence | ARC Prize
Greg Kamradt, a mentor for AI-centric developers and an expert in launching tech products, dives into the nuances of AI benchmarking. He discusses the challenges in creating effective benchmarks, highlighting the intriguing concept of puzzles that are easy for humans but hard for AI. The conversation covers compute tradeoffs and the philosophical implications of tracking AI progress towards AGI. Greg also shares insights on motivating participants in AI competitions and the evolving ARC framework for assessing intelligence in innovative ways.

30 snips
Jun 20, 2025 • 57min
Bridging the Gap Between AI and Business Data // Deepti Srivastava // #325
Deepti Srivastava, Founder and CEO of Snow Leopard AI, with nearly 20 years in data platforms, shares insights on making AI work for businesses. She discusses the challenges of integrating operational data with AI, emphasizing the need for effective connections to structured data. Deepti advocates for simplifying data pipeline management to enhance real-time analytics. She also highlights innovative tools like Snow Leopard that unlock data accessibility, illustrating the importance of embedding AI into a company’s tech stack for improved decision-making.

15 snips
Jun 17, 2025 • 1h 10min
The Creator of FastAPI’s Next Chapter // Sebastián Ramírez // #324
Sebastián Ramírez, known as Tiangolo, is the creator of FastAPI and currently leads FastAPI Labs. He dives into his journey launching FastAPI Cloud and shares humorous tales from a recent hackathon. Sebastián discusses the transformative role of Pydantic in streamlining APIs and reflects on FastAPI's impressive growth and adoption, even by NASA. He addresses challenges in software design and the balance of open-source sustainability, emphasizing a user-centric philosophy that enhances the developer experience.

129 snips
Jun 13, 2025 • 47min
Everything Hard About Building AI Agents Today
Join Willem Pienaar, CTO of Cleric and creator of Feast, along with PhD student Shreya Shankar, as they tackle the toughest challenges in building AI agents. They discuss the ambiguity of 'ground truth' in evaluations, revealing three key gulfs of human-AI interaction that hinder success. The duo emphasizes the importance of moving humans out of the feedback loop, using implicit signals for faster learning. Practical techniques like heat maps for task failures and the complexities of simulated environments are also explored, shedding light on the inevitable performance ceiling of AI.

Jun 11, 2025 • 54min
Tricks to Fine Tuning // Prithviraj Ammanabrolu // #318
In a captivating discussion, Prithviraj Ammanabrolu, an Assistant Professor at UC San Diego and Research Scientist at Databricks, dives deep into the innovative Tao fine-tuning method. This technique allows for training models without labeled data, using reinforcement learning and synthetic inputs. The conversation explores how Tao can enhance small models, optimize limited datasets, and fine-tune outputs effectively. Prithviraj highlights strategies to balance performance, adaptability, and efficiency in machine learning, positioning these advancements as game-changers for model training.

22 snips
Jun 10, 2025 • 56min
Packaging MLOps Tech Neatly for Engineers and Non-engineers // Jukka Remes // #322
Jukka Remes, a Senior Lecturer and AI Architect, shares insights from his extensive experience in MLOps and AI enablement. He discusses the creation of an open-source MLOps platform designed for flexibility across environments, emphasizing the importance of user-friendly tools for both engineers and non-engineers. Jukka also addresses the challenges of transitioning models from research to production, highlights the need for compliance with evolving regulations, and advocates for collaboration to bridge gaps between technical teams and stakeholders.

41 snips
Jun 6, 2025 • 49min
Hard Learned Lessons from Over a Decade in AI
Mike Del Balso, CEO and co-founder of Tecton, shares his decade-long journey in AI innovation, including the creation of Uber's Michelangelo ML platform. He discusses the evolution of predictive machine learning use cases, emphasizing the significance of feature stores. Del Balso explores the challenges of real-time data utilization, fraud detection, and the importance of model maturity in business impact. He also highlights the relevance of integrating generative AI with ML for enhanced marketing efficiency, turning complex data into smarter decisions.

49 snips
Jun 3, 2025 • 53min
Product Metrics are LLM Evals // Raza Habib CEO of Humanloop // #320
Raza Habib, CEO and Co-founder of Humanloop and a PhD in Machine Learning, shares insights on enhancing AI product accuracy by shortening evaluation feedback loops. He discusses the evolution of evaluation methodologies in AI, the complexities of large language models, and the importance of collaboration in overcoming AI challenges. Raza highlights how integrating user feedback can refine model performance and improve user satisfaction, particularly in customer support and performance management. His ideas on prompt engineering and the emerging role of AI in personalized recommendations are also enlightening.

51 snips
May 30, 2025 • 50min
Getting AI Apps Past the Demo // Vaibhav Gupta // #319
Vaibhav Gupta, CEO of BoundaryML and BAML creator, shares insights from his decade in AI performance optimization at giants like Google and Microsoft. He critiques current prompt engineering, advocating for organized coding practices to enhance AI reliability. The discussion spans the evolution of web development and AI integration challenges, emphasizing the need for programming languages that support large models. Gupta also introduces BAML, a language designed for seamless integration, showcasing its promising applications in sectors like government and healthcare.