Get the app
Guanhua Wang
Senior Researcher in the DeepSpeed team at Microsoft. His research focuses on large-scale LLM training and serving. Led the ZeRO++ project and contributed to Microsoft Phi-3 model training.
Best podcasts with Guanhua Wang
Ranked by the Snipd community
8 snips
Dec 17, 2024
• 50min
LLM Distillation and Compression // Guanhua Wang // #278
chevron_right
Guanhua Wang, a Senior Researcher in the DeepSpeed team at Microsoft, dives into the revolutionary Domino training engine, designed to eliminate communication overhead during LLM training. He discusses the intricacies of naming the Phi-3 model and the growing interest in smaller language models. Wang highlights advanced techniques like data offloading and quantization, showcasing how Domino can speed up training by up to 1.3x compared to existing methods, while addressing privacy in customizable copilot models. It's a deep dive into optimizing AI training!
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app