MLOps.community  cover image

MLOps.community

Latest episodes

undefined
37 snips
May 17, 2024 • 44min

Retrieval Augmented Generation

Syed Asad, an Innovator and AI Engineer, discusses Retrieval Augmented Generation (RAG), Semantic Vector Searches, and Vector Databases reshaping data landscapes. Topics include AI model deployment complexities, AI evaluation frameworks, challenges in client approval, and struggles with data ingestion in AI environments.
undefined
May 16, 2024 • 50min

RecSys at Spotify // Sanket Gupta // #232

Senior Machine Learning Engineer at Spotify, Sanket Gupta, discusses foundational embeddings for transfer learning in recommender systems. Topics include large-scale recommender system building, transfer learning with user and item embeddings, system evaluation, and MLOps challenges. They explore music recommendation intricacies, user behavior analysis challenges, and balancing real-time recommendations with scalability. The podcast delves into user representations, cross-content embeddings, and maintaining content freshness for optimal user experiences.
undefined
18 snips
May 10, 2024 • 58min

From A Coding Startup to AI Development in the Enterprise // Ryan Carson // #231

CEO and Founder Ryan Carson discusses democratizing AI development and the impact of new technologies like Gaudi three. He emphasizes the importance of aligning individual work with company goals and the potential benefits of AI in professional interactions. The conversation also touches on practical AI applications, entrepreneurship, and the role of technology in shaping the future.
undefined
May 7, 2024 • 53min

FedML Nexus AI: Your Generative AI Platform at Scale // Salman Avestimehr // #230

Salman Avestimehr, CEO & Founder of FEDML, discusses FEDML Nexus AI, an enterprise AI platform enabling generative AI applications at scale. Topics include challenges in AI platform development, ownership, scalability, integrating with cloud infrastructure, evaluating language models, small scale foundation models in federated learning, and advantages of using small models for AI agents on mobile phones.
undefined
May 3, 2024 • 46min

What is AI Quality? // Mohamed Elgendy // #228

Mohamed Elgendy, Co-Founder & CEO at Kolena, discusses AI Quality with a focus on tailored quality standards, risk management, and edge deployments. The podcast emphasizes the formation of gold standards for AI, collaboration among AI builders, regulators, and infrastructure firms, and the need for diversification in the tech industry. Elgendy's AI Quality Conference aims to set effective but innovation-friendly quality standards for AI.
undefined
8 snips
Apr 30, 2024 • 56min

Handling Multi-Terabyte LLM Checkpoints // Simon Karasik // #228

Simon Karasik, an experienced ML Engineer, discusses handling multi-terabyte LLM checkpoints. Topics include managing massive models, cloud storage options, comparing Slurm and Kubernetes, navigating data processing challenges, monitoring Kubernetes nodes with faulty GPUs, and simplifying model training processes.
undefined
18 snips
Apr 26, 2024 • 43min

Leading Enterprise Data Teams // Sol Rashidi // #227

Sol Rashidi, an esteemed executive in AI and Data, discusses the importance of prioritizing relationships, a 'Wrong Use Cases Formula' for project prioritization, and effective communication in data leadership. She shares insights on balancing criticality and complexity in project prioritization, evaluating team skills, navigating data complexity, adapting to change in company culture, and upcoming AI quality conference.
undefined
56 snips
Apr 23, 2024 • 58min

The Rise of Modern Data Management // Chad Sanderson // #226

Chad Sanderson, CEO of Gable.ai, discusses modern data management, data contracts, and the evolving trends in data infrastructure. He explores the importance of aligning data use with business value, transitioning to federated data management models, and the benefits of AWS Trainium and Inferentia. Sanderson introduces 'Gable' as a platform revolutionizing data management processes with live change detection and collaboration tools, emphasizing data integrity and monetization for businesses.
undefined
Apr 19, 2024 • 54min

Beyond AGI, Can AI Help Save the Planet? // Patrick Beukema // #225

Neuroscientist Patrick Beukema discusses the role of AI in solving environmental challenges, combining remote sensing and AI for real-time global intelligence. He emphasizes the importance of MLOps in ML/AI workflows for continual improvement. Join the fight for sustainability and conservation with advancements in Environmental AI for social good.
undefined
6 snips
Apr 17, 2024 • 49min

GenAI in Production - Challenges and Trends // Verena Weber // #224

Verena Weber, with expertise in NLP and 7+ years in ML, discusses challenges & trends in GenAI: model size, context windows, multimodality, EU AI Act. She compares Gemini 1.0 & 1.5. Topics include AI models in production, BERT, GPT, and empowering women in tech.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app