MLOps.community cover image

MLOps.community

Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)

Latest episodes

Mar 8, 2024 • 1h 10min

The Real E2E RAG Stack // Sam Bean, Rewind AI // #217

From discussing the Real E2E RAG Stack to addressing challenges in building RAG applications, the podcast delves into optimizing systems with DSPI and pipeline efficiency. The journey of complexity and optimization, along with emphasizing motivation and simplification in coding, provides valuable insights for AI and machine learning enthusiasts.

Mar 5, 2024 • 51min

Managing Data for Effective GenAI Application // Anu Arora and Anass Bensrhir // #215

Explore the impact of GenAI on industries and challenges in scaling, data quality hindrances, and non-value-added tasks. Delve into the evolving role of data engineers, LLM integration, and GenAI tools for automation and data handling. Discuss risks with LLM models in AI applications, emphasizing data privacy, compliance, and decision-making strategies.

Mar 1, 2024 • 1h 15min

Becoming an AI Evangelist // Alex Volkov // #215

AI Evangelist Alex Volkov shares his journey from running AI models to founding an AI startup. Explore topics like multimodal transformer architecture for lucid dreaming, challenges in MLOps with video data, and evaluating AI models with 'vibe checks'. Learn about AI agents, automation in content creation, and fostering community engagement in AI development.

Feb 28, 2024 • 31min

LLM Use Cases in Production // AI in Production Conference // Panel 1

Experts discuss practical applications of Large Language Models in customer service, property management, and paper summarization. They explore AI's impact on product development, the importance of clear communication, and maximizing workflow efficiency for revenue growth.

Feb 24, 2024 • 56min

Information Retrieval & Relevance // Daniel Svonava // #214

The podcast with Daniel Svonava discusses the use of vector embeddings in information retrieval, optimizing recommender systems with vector compute, customizing search vectors for relevance, and the efficiency of specialized models. It explores vector databases, deep learning-based retrieval challenges, and the transformative power of vector embeddings in diverse applications.

Feb 21, 2024 • 52min

Evaluating and Integrating ML Models // Morgan McGuire and Anish Shah // #213

Morgan McGuire and Anish Shah discuss the challenges of productionizing large language models, including cost optimization, latency requirements, trust of output, and debugging. They also mention an upcoming AI in Production Conference on February 22 with informative workshops.

Feb 16, 2024 • 1h 6min

Data Governance and AI // Alexandra Diem // #212

Alexandra Diem, Head of Cloud Analytics & MLOps at Gjensidige, discusses challenges of generative AI in sensitive data environments, specialized chatbots, data governance, enabling teams through MVP development, transitioning analysts into data scientists, and the importance of collaboration. Her journey from academia to being a consultant in Norway is also explored.

Feb 13, 2024 • 53min

Ads Ranking Evolution at Pinterest // Aayush Mudgal // #211

Aayush Mudgal, Senior Machine Learning Engineer at Pinterest, discusses the evolution of ads ranking at Pinterest, including transitioning to deep learning-based transformer models. Topics covered include challenges in productionizing large language models, transitioning to deep learning models, incorporating sequential signals, multi-task learning, and transfer learning, scaling machine learning at Pinterest, and the use of transformers in ad rankings and recommendation models.

Feb 9, 2024 • 56min

LLM Evaluation with Arize AI's Aparna Dhinakaran // #210

The podcast discusses the complexities of Language Model evaluation, the use of open-source versus private models, and the urgency of getting models into production. It also explores the challenges of evaluating LLM outcomes and highlights the importance of prompt engineering. Additionally, it emphasizes the need to quickly get ML models into production for identifying bottlenecks and setting up metrics.

Feb 6, 2024 • 1h 4min

Powering MLOps: The Story of Tecton's Rift // Matt Bleifer & Mike Eastham // #209

Guests Matt Bleifer and Mike Eastham from Tecton discuss the challenges and use cases of Large Language Models and feature platforms in MLOps. They also introduce Tecton's new product RIFT, highlight the importance of choosing the right tool for the job, and delve into the design decisions and challenges of data processing and aggregation in a managed service.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner