
MLOps.community
Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)
Latest episodes

14 snips
Mar 8, 2024 • 1h 10min
The Real E2E RAG Stack // Sam Bean, Rewind AI // #217
From discussing the Real E2E RAG Stack to addressing challenges in building RAG applications, the podcast delves into optimizing systems with DSPI and pipeline efficiency. The journey of complexity and optimization, along with emphasizing motivation and simplification in coding, provides valuable insights for AI and machine learning enthusiasts.

10 snips
Mar 5, 2024 • 51min
Managing Data for Effective GenAI Application // Anu Arora and Anass Bensrhir // #215
Explore the impact of GenAI on industries and challenges in scaling, data quality hindrances, and non-value-added tasks. Delve into the evolving role of data engineers, LLM integration, and GenAI tools for automation and data handling. Discuss risks with LLM models in AI applications, emphasizing data privacy, compliance, and decision-making strategies.

Mar 1, 2024 • 1h 15min
Becoming an AI Evangelist // Alex Volkov // #215
AI Evangelist Alex Volkov shares his journey from running AI models to founding an AI startup. Explore topics like multimodal transformer architecture for lucid dreaming, challenges in MLOps with video data, and evaluating AI models with 'vibe checks'. Learn about AI agents, automation in content creation, and fostering community engagement in AI development.

Feb 28, 2024 • 31min
LLM Use Cases in Production // AI in Production Conference // Panel 1
Experts discuss practical applications of Large Language Models in customer service, property management, and paper summarization. They explore AI's impact on product development, the importance of clear communication, and maximizing workflow efficiency for revenue growth.

4 snips
Feb 24, 2024 • 56min
Information Retrieval & Relevance // Daniel Svonava // #214
The podcast with Daniel Svonava discusses the use of vector embeddings in information retrieval, optimizing recommender systems with vector compute, customizing search vectors for relevance, and the efficiency of specialized models. It explores vector databases, deep learning-based retrieval challenges, and the transformative power of vector embeddings in diverse applications.

Feb 21, 2024 • 52min
Evaluating and Integrating ML Models // Morgan McGuire and Anish Shah // #213
Morgan McGuire and Anish Shah discuss the challenges of productionizing large language models, including cost optimization, latency requirements, trust of output, and debugging. They also mention an upcoming AI in Production Conference on February 22 with informative workshops.

Feb 16, 2024 • 1h 6min
Data Governance and AI // Alexandra Diem // #212
Alexandra Diem, Head of Cloud Analytics & MLOps at Gjensidige, discusses challenges of generative AI in sensitive data environments, specialized chatbots, data governance, enabling teams through MVP development, transitioning analysts into data scientists, and the importance of collaboration. Her journey from academia to being a consultant in Norway is also explored.

Feb 13, 2024 • 53min
Ads Ranking Evolution at Pinterest // Aayush Mudgal // #211
Aayush Mudgal, Senior Machine Learning Engineer at Pinterest, discusses the evolution of ads ranking at Pinterest, including transitioning to deep learning-based transformer models. Topics covered include challenges in productionizing large language models, transitioning to deep learning models, incorporating sequential signals, multi-task learning, and transfer learning, scaling machine learning at Pinterest, and the use of transformers in ad rankings and recommendation models.

Feb 9, 2024 • 56min
LLM Evaluation with Arize AI's Aparna Dhinakaran // #210
The podcast discusses the complexities of Language Model evaluation, the use of open-source versus private models, and the urgency of getting models into production. It also explores the challenges of evaluating LLM outcomes and highlights the importance of prompt engineering. Additionally, it emphasizes the need to quickly get ML models into production for identifying bottlenecks and setting up metrics.

10 snips
Feb 6, 2024 • 1h 4min
Powering MLOps: The Story of Tecton's Rift // Matt Bleifer & Mike Eastham // #209
Guests Matt Bleifer and Mike Eastham from Tecton discuss the challenges and use cases of Large Language Models and feature platforms in MLOps. They also introduce Tecton's new product RIFT, highlight the importance of choosing the right tool for the job, and delve into the design decisions and challenges of data processing and aggregation in a managed service.