MLOps.community

Demetrios
undefined
Jan 24, 2025 • 53min

Collective Memory for AI on Decentralized Knowledge Graph // Tomaž Levak // #285

Tomaž Levak, Co-founder and CEO of Trace Labs, dives into the world of decentralized knowledge graphs and their role in AI. He discusses how these graphs enhance data integrity and privacy while promoting collaboration among organizations. Practical use cases in enterprise sectors are highlighted, showcasing their economic potential. Levak also explores the fusion of AI and personal health management, emphasizing innovative technologies that improve well-being. The conversation concludes with insights on the future of decentralized AI and its convergence with blockchain.
undefined
8 snips
Jan 17, 2025 • 52min

Efficient Deployment of Models at the Edge // Krishna Sridhar // #284

In this engaging discussion, Krishna Sridhar, an engineering leader at Qualcomm and former co-founder of Tetra AI, dives into the efficient deployment of AI models at the edge. He shares insights on using Qualcomm AI Hub to optimize models for on-device performance, highlighting its application in real-time sports tracking and mobile photography. Krishna also explores the balance between hardware and software optimization in modern devices. Plus, he reveals how innovations in edge computing are transforming everyday AI applications while ensuring user privacy.
undefined
46 snips
Jan 15, 2025 • 47min

Real World AI Agent Stories // Zach Wallace // #283

Zach Wallace, a Staff Software Engineer at Nearpod Inc., shares his expertise in AI integration within e-commerce and edtech. He discusses how AI agents enhance personalized user targeting and streamline data with tools like Redshift and DBT. The conversation delves into the challenges of maintaining AI systems, ensuring data quality, and the balance between specialization and cost in agent performance. Zach emphasizes the transformative potential of LLMs in education and the importance of educator involvement for effective AI tool development.
undefined
36 snips
Jan 8, 2025 • 1h 5min

Machine Learning, AI Agents, and Autonomy // Egor Kraev // #282

Egor Kraev, Principal AI Scientist at Wise Plc and founder of the Swiss Pirate Party, dives into the transformative power of AI in fintech. He shares insights on integrating large language models into machine learning pipelines and the practical implications of his open-source MotleyCrew framework. Egor highlights the role of AI in improving fraud detection and optimizing currency flow. He also discusses the importance of autonomy within teams, navigating causal inference in marketing, and enhancing user engagement through targeted campaigns.
undefined
Jan 3, 2025 • 51min

Re-Platforming Your Tech Stack // Michelle Marie Conway & Andrew Baker // #281

In this discussion, Michelle Marie Conway, Lead Data Scientist at Lloyds Banking Group, and Andrew Baker, Data Science Delivery Lead, share insights from their cloud migration journey. They delve into the transition from on-prem technology to the cloud, highlighting the complexities of model management and engineering practices. Their conversation also touches on the harmony between music and technology, the challenges of chaos engineering in regulated environments, and the importance of collaboration within data science and platform teams.
undefined
27 snips
Dec 23, 2024 • 58min

Holistic Evaluation of Generative AI Systems // Jineet Doshi // #280

In this insightful discussion, Jineet Doshi, an award-winning AI lead with over seven years at Intuit, dives deep into the complexities of evaluating generative AI systems. He emphasizes the importance of holistic evaluation to foster trust and the unique challenges posed by large language models. Jineet explores diverse evaluation methods, from classic NLP techniques to innovative strategies like red teaming. He also tackles the financial nuances of generative AI and the balance between human insight and automated feedback for robust assessments.
undefined
24 snips
Dec 20, 2024 • 1h 15min

Unleashing Unconstrained News Knowledge Graphs to Combat Misinformation // Robert Caulk // #279

Robert Caulk, the founder of Emergent Methods and an expert in large-scale applications, discusses the cutting-edge development of unconstrained knowledge graphs to counter misinformation. He reveals how new tools allow for the processing of vast amounts of news data more efficiently. The podcast explores the integration of knowledge graphs with AI, enhancing user interaction and the fight against false narratives. Caulk emphasizes the ethical challenges of data handling and the role of advanced AI models in improving sentiment analysis, showcasing a future of responsible information management.
undefined
8 snips
Dec 17, 2024 • 50min

LLM Distillation and Compression // Guanhua Wang // #278

Guanhua Wang, a Senior Researcher in the DeepSpeed team at Microsoft, dives into the revolutionary Domino training engine, designed to eliminate communication overhead during LLM training. He discusses the intricacies of naming the Phi-3 model and the growing interest in smaller language models. Wang highlights advanced techniques like data offloading and quantization, showcasing how Domino can speed up training by up to 1.3x compared to existing methods, while addressing privacy in customizable copilot models. It's a deep dive into optimizing AI training!
undefined
18 snips
Dec 11, 2024 • 58min

AI's Next Frontier // Aditya Naganath // #277

Aditya Naganath, an experienced investor at Kleiner Perkins, delves into AI's next frontier, focusing on the collaboration between AI and knowledge workers. He discusses the evolving landscape of AI investments, emphasizing the significance of strong teams and go-to-market strategies. The conversation also highlights the shift towards low-code and no-code tools, democratizing access to technology, and innovative challenges in AI infrastructure. Aditya provides insights into GPU reliability issues, economic dynamics in AI services, and the growing importance of inference in the AI space.
undefined
15 snips
Dec 4, 2024 • 57min

PyTorch for Control Systems and Decision Making // Vincent Moens // #276

Vincent Moens, an Applied Machine Learning Research Scientist at Meta and the author behind TorchRL and TensorDict, delves into the fascinating applications of PyTorch in control systems and decision-making. He shares insights on optimizing performance using practical tips, including the nuances of pin memory for CUDA transfers. The discussion covers the pitfalls of in-place tensor modifications and introduces TensorDict as a solution for efficient data handling. Additionally, Vincent emphasizes community collaboration to enhance developer experiences and improve user-friendly APIs in PyTorch.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app