MLOps.community

Demetrios
undefined
10 snips
Nov 29, 2024 • 57min

AI-Driven Code: Navigating Due Diligence & Transparency in MLOps // Matt van Itallie // #275

In this engaging discussion, Matt van Itallie, founder and CEO of Sema, shares insights on the importance of comprehensive codebase scans for technical due diligence. He reveals how Generative AI is reshaping code transparency and introduces the Generative AI Bill of Materials (GBOM) for managing AI-generated code risks. Matt emphasizes the necessity of bridging technical and business viewpoints in AI evaluation, highlighting practical strategies for assessing cloud costs and optimizing code quality. His insights are invaluable for both technical and non-technical audiences.
undefined
17 snips
Nov 26, 2024 • 58min

PyTorch's Combined Effort in Large Model Optimization // Michael Gschwind // #274

Michael Gschwind, Director/Principal Engineer for PyTorch at Meta Platforms, shares his insights on AI advancements. He discusses the evolution from gaming hardware to modern AI, highlighting the pivotal role of community collaboration. The conversation covers the development of Torch Chat for large language models, energy-efficient optimization techniques, and the exciting shift toward on-device AI solutions. Gschwind also emphasizes strategic optimization to avoid premature pitfalls in technology development.
undefined
Nov 22, 2024 • 33min

LLMs to agents: The Beauty & Perils of Investing in GenAI // VC Panel // Agents in Production

Join Meera Clark, a Principal at Redpoint Ventures, Sandeep Bakshi, from Prosus Ventures, and George Robson of Sequoia Capital as they dissect the thrilling yet challenging realm of AI investments. They explore the exciting applications of large language models in various industries, the strategic pivots needed for new startups to compete with giants, and the evolving expectations of consumers. The panel sheds light on economic hurdles, scalability issues, and the vital role of sustainable business models in navigating the future of generative AI.
undefined
4 snips
Nov 20, 2024 • 51min

We Can All Be AI Engineers and We Can Do It with Open Source Models // Luke Marsden // #273

Luke Marsden, CEO of HelixML and a seasoned tech leader, dives into the world of open-source AI models. He discusses how anyone can become an AI engineer, emphasizing the practicality of building Generative AI applications. Luke elaborates on the advantages of open-source solutions for data privacy and business value. He also highlights the importance of structured specifications and customization in AI systems, making advanced features accessible for both technical and non-technical users. Join him for insights into the future of AI innovation!
undefined
Nov 15, 2024 • 29min

Exploring AI Agents: Voice, Visuals, and Versatility // Panel // Agents in Production

Jazmia Henry, Founder and CEO of Iso AI, joins a panel of AI experts to explore the future of AI agents. They discuss the integration of voice and visual interfaces, emphasizing how smaller language models can enhance usability. The panel highlights AI's transformative impact across industries like insurance and manufacturing, and the efficiency of deconstructing tasks for better performance. Challenges in scaling AI agents and the importance of quality data are also examined, paving the way for innovative solutions in the rapidly evolving AI landscape.
undefined
Nov 13, 2024 • 1h 8min

The Impact of UX Research in the AI Space // Lauren Kaplan // #272

In this engaging conversation, Lauren Kaplan, a sociologist with a PhD from Goethe University Frankfurt and former UX researcher at Meta, shares her insights into the pivotal role of UX research in shaping AI initiatives. She delves into the challenges of bias in research, emphasizing the importance of unbiased data collection. Kaplan discusses collaborative strategies between UX researchers and ML engineers to enhance user-centric AI tools. Highlighting the necessity for effective communication and alignment within organizations, she champions structured research to drive innovation and user satisfaction.
undefined
5 snips
Nov 1, 2024 • 59min

EU AI Act - Navigating New Legislation // Petar Tsankov // MLOps Podcast #271

Petar Tsankov, Co-founder and CEO of LatticeFlow AI, dives into the complexities of the EU AI Act and its impact on AI innovation. He discusses the importance of translating legislation into practical technical requirements. Petar introduces 'Comply,' an open-source tool for AI compliance, while emphasizing the need for robust benchmarks in AI safety. He also sheds light on managing AI risks and the collaboration required among stakeholders to navigate evolving regulations, making it essential listening for AI developers and businesses alike.
undefined
7 snips
Oct 22, 2024 • 55min

Boosting LLM/RAG Workflows & Scheduling w/ Composable Memory and Checkpointing // Bernie Wu // #270

Bernie Wu, VP of Strategic Partnerships at MemVerge, brings over 25 years of experience in data infrastructure. He discusses the critical role of innovative memory solutions in optimizing Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) workflows. The conversation covers the advantages of composable memory in alleviating performance limits, efficient resource scheduling, and overcoming GPU challenges. Bernie also touches on the importance of collaboration tools for better memory management and advances in GPU networking technologies that are shaping the future of AI.
undefined
91 snips
Oct 18, 2024 • 1h 2min

How to Systematically Test and Evaluate Your LLMs Apps // Gideon Mendels // #269

Gideon Mendels, CEO and co-founder of Comet, dives into the intricate world of testing and evaluating LLMs. He discusses the hybrid approach required for these applications, merging machine learning with software engineering best practices. Topics include innovative methods for evaluating LLMs beyond traditional metrics, the challenge of unit testing with deterministic assertions, and the importance of experiment tracking in ensuring reproducibility. Gideon also highlights the role of user interaction analysis in enhancing LLM applications' performance.
undefined
13 snips
Oct 15, 2024 • 51min

Exploring the Impact of Agentic Workflows // Raj Rikhy // #268

In this engaging discussion, Raj Rikhy, a Senior Product Manager at Microsoft AI + R, shares insights on deploying AI agents effectively. He highlights the importance of starting small with clear success criteria while maintaining human oversight to manage AI unpredictability. Raj dives into real-time applications like fraud detection and supply chain optimization, emphasizing the efficiency gains from agentic workflows. He also compares this transformative technology to innovations like the iPhone, encouraging listeners to embrace the future of AI.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app