MLOps.community  cover image

MLOps.community

Latest episodes

undefined
24 snips
Dec 23, 2024 • 58min

Holistic Evaluation of Generative AI Systems // Jineet Doshi // #280

In this insightful discussion, Jineet Doshi, an award-winning AI lead with over seven years at Intuit, dives deep into the complexities of evaluating generative AI systems. He emphasizes the importance of holistic evaluation to foster trust and the unique challenges posed by large language models. Jineet explores diverse evaluation methods, from classic NLP techniques to innovative strategies like red teaming. He also tackles the financial nuances of generative AI and the balance between human insight and automated feedback for robust assessments.
undefined
24 snips
Dec 20, 2024 • 1h 15min

Unleashing Unconstrained News Knowledge Graphs to Combat Misinformation // Robert Caulk // #279

Robert Caulk, the founder of Emergent Methods and an expert in large-scale applications, discusses the cutting-edge development of unconstrained knowledge graphs to counter misinformation. He reveals how new tools allow for the processing of vast amounts of news data more efficiently. The podcast explores the integration of knowledge graphs with AI, enhancing user interaction and the fight against false narratives. Caulk emphasizes the ethical challenges of data handling and the role of advanced AI models in improving sentiment analysis, showcasing a future of responsible information management.
undefined
Dec 17, 2024 • 50min

LLM Distillation and Compression // Guanhua Wang // #278

Guanhua Wang, a Senior Researcher in the DeepSpeed team at Microsoft, dives into the revolutionary Domino training engine, designed to eliminate communication overhead during LLM training. He discusses the intricacies of naming the Phi-3 model and the growing interest in smaller language models. Wang highlights advanced techniques like data offloading and quantization, showcasing how Domino can speed up training by up to 1.3x compared to existing methods, while addressing privacy in customizable copilot models. It's a deep dive into optimizing AI training!
undefined
18 snips
Dec 11, 2024 • 58min

AI's Next Frontier // Aditya Naganath // #277

Aditya Naganath, an experienced investor at Kleiner Perkins, delves into AI's next frontier, focusing on the collaboration between AI and knowledge workers. He discusses the evolving landscape of AI investments, emphasizing the significance of strong teams and go-to-market strategies. The conversation also highlights the shift towards low-code and no-code tools, democratizing access to technology, and innovative challenges in AI infrastructure. Aditya provides insights into GPU reliability issues, economic dynamics in AI services, and the growing importance of inference in the AI space.
undefined
15 snips
Dec 4, 2024 • 57min

PyTorch for Control Systems and Decision Making // Vincent Moens // #276

Vincent Moens, an Applied Machine Learning Research Scientist at Meta and the author behind TorchRL and TensorDict, delves into the fascinating applications of PyTorch in control systems and decision-making. He shares insights on optimizing performance using practical tips, including the nuances of pin memory for CUDA transfers. The discussion covers the pitfalls of in-place tensor modifications and introduces TensorDict as a solution for efficient data handling. Additionally, Vincent emphasizes community collaboration to enhance developer experiences and improve user-friendly APIs in PyTorch.
undefined
10 snips
Nov 29, 2024 • 57min

AI-Driven Code: Navigating Due Diligence & Transparency in MLOps // Matt van Itallie // #275

In this engaging discussion, Matt van Itallie, founder and CEO of Sema, shares insights on the importance of comprehensive codebase scans for technical due diligence. He reveals how Generative AI is reshaping code transparency and introduces the Generative AI Bill of Materials (GBOM) for managing AI-generated code risks. Matt emphasizes the necessity of bridging technical and business viewpoints in AI evaluation, highlighting practical strategies for assessing cloud costs and optimizing code quality. His insights are invaluable for both technical and non-technical audiences.
undefined
7 snips
Nov 26, 2024 • 58min

PyTorch's Combined Effort in Large Model Optimization // Michael Gschwind // #274

Michael Gschwind, Director/Principal Engineer for PyTorch at Meta Platforms, shares his insights on AI advancements. He discusses the evolution from gaming hardware to modern AI, highlighting the pivotal role of community collaboration. The conversation covers the development of Torch Chat for large language models, energy-efficient optimization techniques, and the exciting shift toward on-device AI solutions. Gschwind also emphasizes strategic optimization to avoid premature pitfalls in technology development.
undefined
Nov 22, 2024 • 33min

LLMs to agents: The Beauty & Perils of Investing in GenAI // VC Panel // Agents in Production

Join Meera Clark, a Principal at Redpoint Ventures, Sandeep Bakshi, from Prosus Ventures, and George Robson of Sequoia Capital as they dissect the thrilling yet challenging realm of AI investments. They explore the exciting applications of large language models in various industries, the strategic pivots needed for new startups to compete with giants, and the evolving expectations of consumers. The panel sheds light on economic hurdles, scalability issues, and the vital role of sustainable business models in navigating the future of generative AI.
undefined
4 snips
Nov 20, 2024 • 51min

We Can All Be AI Engineers and We Can Do It with Open Source Models // Luke Marsden // #273

Luke Marsden, CEO of HelixML and a seasoned tech leader, dives into the world of open-source AI models. He discusses how anyone can become an AI engineer, emphasizing the practicality of building Generative AI applications. Luke elaborates on the advantages of open-source solutions for data privacy and business value. He also highlights the importance of structured specifications and customization in AI systems, making advanced features accessible for both technical and non-technical users. Join him for insights into the future of AI innovation!
undefined
Nov 15, 2024 • 29min

Exploring AI Agents: Voice, Visuals, and Versatility // Panel // Agents in Production

Jazmia Henry, Founder and CEO of Iso AI, joins a panel of AI experts to explore the future of AI agents. They discuss the integration of voice and visual interfaces, emphasizing how smaller language models can enhance usability. The panel highlights AI's transformative impact across industries like insurance and manufacturing, and the efficiency of deconstructing tasks for better performance. Challenges in scaling AI agents and the importance of quality data are also examined, paving the way for innovative solutions in the rapidly evolving AI landscape.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode