MLOps.community  cover image

MLOps.community

LLM Distillation and Compression // Guanhua Wang // #278

Dec 17, 2024
Guanhua Wang, a Senior Researcher in the DeepSpeed team at Microsoft, dives into the revolutionary Domino training engine, designed to eliminate communication overhead during LLM training. He discusses the intricacies of naming the Phi-3 model and the growing interest in smaller language models. Wang highlights advanced techniques like data offloading and quantization, showcasing how Domino can speed up training by up to 1.3x compared to existing methods, while addressing privacy in customizable copilot models. It's a deep dive into optimizing AI training!
49:47

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • High-quality, noise-free data from reputable sources is crucial for training effective language models, surpassing the effectiveness of synthetic data.
  • Domino optimizes LLM training by minimizing communication overhead between GPUs, enabling faster training speeds through better computation integration.

Deep dives

Innovations in Small Language Models

Creating high-performing small language models involves rigorous approaches to data quality and preprocessing. The discussion highlights the significant importance placed on using high-quality, noise-free data sourced from reputable publications like the New York Times and Forbes, which is essential for training effective models. The reliance on high-quality data is emphasized over synthetic data, as the latter is often deemed insufficient in providing the necessary variety and accuracy for model training. Furthermore, the need for customized data becomes clear during post-training to enhance overall performance.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode