

MLOps.community
Demetrios
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Episodes
Mentioned books

49 snips
Jun 3, 2025 • 53min
Product Metrics are LLM Evals // Raza Habib CEO of Humanloop // #320
Raza Habib, CEO and Co-founder of Humanloop and a PhD in Machine Learning, shares insights on enhancing AI product accuracy by shortening evaluation feedback loops. He discusses the evolution of evaluation methodologies in AI, the complexities of large language models, and the importance of collaboration in overcoming AI challenges. Raza highlights how integrating user feedback can refine model performance and improve user satisfaction, particularly in customer support and performance management. His ideas on prompt engineering and the emerging role of AI in personalized recommendations are also enlightening.

51 snips
May 30, 2025 • 50min
Getting AI Apps Past the Demo // Vaibhav Gupta // #319
Vaibhav Gupta, CEO of BoundaryML and BAML creator, shares insights from his decade in AI performance optimization at giants like Google and Microsoft. He critiques current prompt engineering, advocating for organized coding practices to enhance AI reliability. The discussion spans the evolution of web development and AI integration challenges, emphasizing the need for programming languages that support large models. Gupta also introduces BAML, a language designed for seamless integration, showcasing its promising applications in sectors like government and healthcare.

12 snips
May 23, 2025 • 48min
Building Out GPU Clouds // Mohan Atreya // #317
Mohan Atreya, Chief Product Officer at Rafay Systems with a rich background at Okta and McAfee, dives into the chaos of GPUs in AI. He discusses the hurdles of GPU scarcity and high prices, as well as dynamic cloud models that adopt tokenized access. The conversation highlights the challenges of crafting GPU cloud infrastructures, power management issues, and how innovative strategies are redefining user experience. Mohan also touches on the shift towards payer-friendly systems that enhance flexibility, paving the way for a more efficient AI landscape.

139 snips
May 21, 2025 • 1h 5min
A Candid Conversation Around MCP and A2A // Rahul Parundekar and Sam Partee // #316 SF Live
Rahul Parundekar, Founder of AI Hero, Inc., and Sam Partee, CTO of Arcade AI, dive into the complexities of AI agents and tools. They tackle the significance of digital permissions and the challenges of agent-to-agent interactions. The duo unveils the intricacies of authentication processes, emphasizing OAuth's role in security. They also discuss the evolution of agent-based programming and AI tools, highlighting the need for improved evaluation methods. With humor, they address the frustrations of automated email responses while celebrating AI's potential to transform workflows.

May 16, 2025 • 56min
AI in M&A: Building, Buying, and the Future of Dealmaking // Kison Patel // #315
Kison Patel, Founder and CEO of DealRoom and M&A Science, shares his insights on the intersection of AI and M&A. He discusses how AI capabilities are transforming deal-making and what makes AI companies attractive targets. Kison emphasizes the importance of a buyer-led M&A strategy and shares strategies for effective team building and communication. He also explores the challenges of pricing AI products and the cautious valuation approaches in the rapidly evolving market. Tune in for a thought-provoking look at the future of deal-making!

May 14, 2025 • 50min
AI, Marketing, and Human Decision Making // Fausto Albers // #313
Fausto Albers, an AI Engineer and community lead at AI Builders Club Amsterdam, dives into the interplay between AI, marketing, and human decision-making. He discusses the exciting potential of generative AI in transforming creative workflows and A/B testing while cautioning against the risks of diminishing critical thinking. The conversation also touches on the importance of human connection in an automated world and how technological integration must remain balanced with personal interactions in various industries, especially hospitality.

30 snips
May 13, 2025 • 53min
MLOps with Databricks // Maria Vechtomova // #314
Maria Vechtomova, an MLOps Tech Lead and co-founder of Marvelous MLOps, shares her insights on the complexities of MLOps and the advantages of using Databricks. She discusses the challenges data scientists face transitioning from notebooks to production-ready models and stresses the importance of model packaging. The conversation also touches on emerging terms like 'LLM Ops,' new features in MLflow, and the practical uses of Databricks for model serving. Plus, she mentions an upcoming hands-on course and a book on Databricks, aimed at enhancing the learning experience.

71 snips
May 6, 2025 • 1h 2min
Making AI Reliable is the Greatest Challenge of the 2020s // Alon Bochman // #312
Alon Bochman, CEO of RagMetrics and AI veteran, dives into the complexities of making AI reliable. He emphasizes empirical evaluation over influencer advice, advocating for collaboration between technical and domain experts. Alon discusses the importance of tailoring AI solutions and involving subject matter experts in development. The conversation also covers fine-tuning language models through expert feedback and the challenges of AI in finance, highlighting the need for effective knowledge-sharing to enhance accuracy in decision-making.

28 snips
May 2, 2025 • 1h 2min
Behavior Modeling, Secondary AI Effects, Bias Reduction & Synthetic Data // Devansh Devansh // #311
In this engaging discussion, Devansh Devansh, an open-source AI researcher and Head of AI at a stealth startup, shares insights on grounded AI research and the biases present in data. He emphasizes the balance between deterministic systems and autonomous agents, urging a rethink of data infrastructures. The conversation delves into the potential of synthetic data for reducing biases and enhancing fairness, encouraging listeners to consider ethical implications. Devansh also highlights the critical role of behavioral modeling in improving user experiences and insights.

17 snips
Apr 29, 2025 • 1h 14min
GraphBI: Expanding Analytics to All Data Through the Combination of GenAI, Graph, & Visual Analytics // Paco Nathan & Weidong Yang // #310
GraphBI: Expanding Analytics to All Data Through the Combination of GenAI, Graph, & Visual Analytics // MLOps Podcast #310 with Paco Nathan, Principal DevRel Engineer at Senzing & Weidong Yang, CEO of Kineviz.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractExisting BI and big data solutions depend largely on structured data, which makes up only about 20% of all available information, leaving the vast majority untapped. In this talk, we introduce GraphBI, which aims to address this challenge by combining GenAI, graph technology, and visual analytics to unlock the full potential of enterprise data.Recent technologies like RAG (Retrieval-Augmented Generation) and GraphRAG leverage GenAI for tasks such as summarization and Q&A, but they often function as black boxes, making verification challenging. In contrast, GraphBI uses GenAI for data pre-processing—converting unstructured data into a graph-based format—enabling a transparent, step-by-step analytics process that ensures reliability.We will walk through the GraphBI workflow, exploring best practices and challenges in each step of the process: managing both structured and unstructured data, data pre-processing with GenAI, iterative analytics using a BI-focused graph grammar, and final insight presentation. This approach uniquely surfaces business insights by effectively incorporating all types of data.// BioPaco NathanPaco is a "player/coach" who excels in data science, machine learning, and natural language, with 40 years of industry experience. He leads DevRel for the Entity Resolved Knowledge Graph practice area at Senzing.com and advises Argilla.io, Kurve.ai, KungFu.ai, and DataSpartan.co.uk, and is lead committer for the pytextrank and kglab open source projects. Formerly: Director of Learning Group at O'Reilly Media; and Director of Community Evangelism at Databricks.Weidong YangWeidong Yang, Ph.D., is the founder and CEO of Kineviz, a San Francisco-based company that develops interactive visual analytics based solutions to address complex big data problems. His expertise spans Physics, Computer Science and Performing Art, with significant contributions to the semiconductor industry and quantum dot research at UC, Berkeley and Silicon Valley. Yang also leads Kinetech Arts, a 501(c) non-profit blending dance, science, and technology. An eloquent public speaker and performer, he holds 11 US patents, including the groundbreaking Diffraction-based Overlay technology, vital for sub-10-nm semiconductor production.// Related LinksWebsite: https://www.kineviz.com/Blog: https://medium.com/kinevizWebsite: https://derwen.ai/pacohttps://huggingface.co/pacoidhttps://github.com/ceterihttps://neo4j.com/developer-blog/entity-resolved-knowledge-graphs/~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our Slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Weidong on LinkedIn: /yangweidong/Connect with Paco on LinkedIn: /ceteri/