MLOps.community

Demetrios
undefined
Mar 26, 2024 • 1h 4min

4 Years of the MLOps Community // Demetrios Brinkmann // #220

Demetrios Brinkmann, Founder of the MLOps Community, discusses the origin, structure, and challenges of the community. They talk about job dynamics, sustained relationships, hosting events, and transitioning to sponsorship-based. Demetrios reflects on his journey to Germany and envisions a global hub for AI learning.
undefined
7 snips
Mar 22, 2024 • 1h 15min

The Art and Science of Training LLMs // Bandish Shah and Davis Blalock // #219

Exploring the challenges of training large language models, including debugging issues and evaluating machine learning models effectively. The discussion covers the importance of data quality, efficient computation techniques, and optimizing machine learning model training and deployment for successful outcomes.
undefined
Mar 19, 2024 • 35min

Security and Privacy // Day 2 Panel 1 // AI in Production Conference

Experts discuss the risks and evolving security landscape of AI, emphasizing education in managing AI risks and privacy engineering. They explore legal and ethical implications of AI, balance between utility and privacy, and the importance of safeguarding models and data in AI solutions. The conversation delves into memory, learning, legal frameworks, privacy concerns in large models, and Apple's business strategies in AI.
undefined
6 snips
Mar 15, 2024 • 59min

[Exclusive] Zilliz Roundtable // Why Purpose-built Vector Databases Matter for Your Use Case

Engineers from Zilliz discuss the importance of purpose-built vector databases for AI applications. They cover challenges with large language models and solutions for efficient retrieval tasks. The podcast also explores upcoming features in Millvis two four, including hybrid search capabilities and data management strategies in vector databases.
undefined
Mar 12, 2024 • 58min

A Decade of AI Safety and Trust // Petar Tsankov // MLOps Podcast #218

The podcast delves into AI safety and trust over the past decade, emphasizing the importance of reliability and transparency in deploying models. It explores the contrasting educational environments in the US and Switzerland, highlighting the journey of the speaker. Discussions cover challenges in ensuring trust in AI models, the impact of generative AI, and the need for comprehensive testing post-deployment to build trust.
undefined
14 snips
Mar 8, 2024 • 1h 10min

The Real E2E RAG Stack // Sam Bean, Rewind AI // #217

From discussing the Real E2E RAG Stack to addressing challenges in building RAG applications, the podcast delves into optimizing systems with DSPI and pipeline efficiency. The journey of complexity and optimization, along with emphasizing motivation and simplification in coding, provides valuable insights for AI and machine learning enthusiasts.
undefined
10 snips
Mar 5, 2024 • 51min

Managing Data for Effective GenAI Application // Anu Arora and Anass Bensrhir // #215

Explore the impact of GenAI on industries and challenges in scaling, data quality hindrances, and non-value-added tasks. Delve into the evolving role of data engineers, LLM integration, and GenAI tools for automation and data handling. Discuss risks with LLM models in AI applications, emphasizing data privacy, compliance, and decision-making strategies.
undefined
Mar 1, 2024 • 1h 15min

Becoming an AI Evangelist // Alex Volkov // #215

AI Evangelist Alex Volkov shares his journey from running AI models to founding an AI startup. Explore topics like multimodal transformer architecture for lucid dreaming, challenges in MLOps with video data, and evaluating AI models with 'vibe checks'. Learn about AI agents, automation in content creation, and fostering community engagement in AI development.
undefined
Feb 28, 2024 • 31min

LLM Use Cases in Production // AI in Production Conference // Panel 1

Experts discuss practical applications of Large Language Models in customer service, property management, and paper summarization. They explore AI's impact on product development, the importance of clear communication, and maximizing workflow efficiency for revenue growth.
undefined
4 snips
Feb 24, 2024 • 56min

Information Retrieval & Relevance // Daniel Svonava // #214

The podcast with Daniel Svonava discusses the use of vector embeddings in information retrieval, optimizing recommender systems with vector compute, customizing search vectors for relevance, and the efficiency of specialized models. It explores vector databases, deep learning-based retrieval challenges, and the transformative power of vector embeddings in diverse applications.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app