MLOps.community  cover image

MLOps.community

Latest episodes

undefined
Jun 14, 2024 • 57min

How to Build Production-Ready AI Models for Manufacturing // [Exclusive] LatticeFlow Roundtable

Discussion on challenges in deploying AI models in manufacturing, optimizing models for corner cases, transitioning models to production, and exploring trust in traditional ML vs LLMs. Special guests share insights on managing battery health, semiconductor manufacturing, and maintaining reliability and customer trust in industrial settings.
undefined
Jun 11, 2024 • 58min

From Robotics to Recommender Systems // Miguel Fierro // #240

Miguel Fierro, Principal Data Science Manager at Microsoft, discusses the challenges of applying ML in robotics and the integration of computer vision in sports analytics. He highlights the role of AI in strategic game analysis and explores the evolution of recommendation systems, emphasizing the importance of real-time architectures for personalized recommendations.
undefined
Jun 7, 2024 • 36min

Uber's Michelangelo: Strategic AI Overhaul and Impact // #239

Demetrios Brinkmann, AI strategist at Uber, discusses the evolution of Michelangelo platform at Uber, from basic ML predictions to deep learning and generative AI. Covering challenges faced in early versions and improvements in Michelangelo 2.0 and 3.0 like Pytorch support, enhanced model training, and integration of technologies like Nvidia’s Triton and Kubernetes. The platform now includes features like a Genai gateway, compliance guardrails, and model performance monitoring to streamline AI operations.
undefined
Jun 4, 2024 • 45min

AWS Tranium and Inferentia // Kamran Khan and Matthew McClean // #238

Join Kamran Khan and Matthew McClean as they discuss AWS Trainium and Inferentia, powerful AI accelerators offering enhanced performance and cost savings. They delve into integration with PyTorch, JAX, and Hugging Face, along with support from industry leaders like W&B. Explore the evolution and performance comparison of these AI chips, flexibility in model training with Trainium, and workflow integration with SageMaker. Discover the distinctions between inference and training on accelerators and explore AWS services for generative AI.
undefined
May 31, 2024 • 47min

Build Reliable Systems with Chaos Engineering // Benjamin Wilms // #237

Benjamin Wilms, a chaos and resilience engineering expert, discusses integrating Chaos Engineering into the CI/CD pipeline for system resilience. They explore the cultural shift needed to embrace failures as learning opportunities and the transition to structured chaos engineering experiments. The conversation also covers reflection on errors, AWS's Trainium, chaos engineering methods, and the intersection of chaos engineering and observability for reliable systems.
undefined
May 28, 2024 • 1h 5min

Managing Small Knowledge Graphs for Multi-agent Systems // Tom Smoker // #236

Tom Smoker, Cofounder of WhyHow.ai, discusses using knowledge graphs in multi-agent systems. Topics include mitigating hallucination issues, optimizing search with knowledge graphs, agile problem-solving, and integrating vector databases. The conversation explores agents in multi-agent systems, stepping back for growth, and using AI models for automated content creation and revenue generation.
undefined
May 27, 2024 • 1h 2min

Just when we Started to Solve Software Docs, AI Blew Everything Up // Dave Nunez // #235

Dave Nunez, Partner at Abstract Group, discusses how AI is changing developer documentation strategies. He emphasizes the need to rewrite the developer education playbook for AI-focused software. Topics include enhancing user experience with design cues, escape patches, and intuitive documentation. The importance of clear, user-friendly content and effective onboarding experiences is highlighted.
undefined
May 21, 2024 • 46min

Open Standards Make MLOps Easier and Silos Harder // Cody Peterson // #234

Cody Peterson, Senior Technical Product Manager at Voltron Data, discusses the importance of open standards in MLOps. Topics include challenges with scalability in data tools like Pandas, leveraging the Ibis project for big data processing, and the power of Apache Arrow in data systems. The conversation also covers transitioning between platforms, considerations for data system selection, and future plans for the Ibis project.
undefined
May 17, 2024 • 44min

Retrieval Augmented Generation

Syed Asad, an Innovator and AI Engineer, discusses Retrieval Augmented Generation (RAG), Semantic Vector Searches, and Vector Databases reshaping data landscapes. Topics include AI model deployment complexities, AI evaluation frameworks, challenges in client approval, and struggles with data ingestion in AI environments.
undefined
May 16, 2024 • 50min

RecSys at Spotify // Sanket Gupta // #232

Senior Machine Learning Engineer at Spotify, Sanket Gupta, discusses foundational embeddings for transfer learning in recommender systems. Topics include large-scale recommender system building, transfer learning with user and item embeddings, system evaluation, and MLOps challenges. They explore music recommendation intricacies, user behavior analysis challenges, and balancing real-time recommendations with scalability. The podcast delves into user representations, cross-content embeddings, and maintaining content freshness for optimal user experiences.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode