Guanhua Wang

Senior Researcher in the DeepSpeed team at Microsoft. His research focuses on large-scale LLM training and serving. Led the ZeRO++ project and contributed to Microsoft Phi-3 model training.

Best podcasts with Guanhua Wang

Ranked by the Snipd community

8 snips

Dec 17, 2024 • 50min

LLM Distillation and Compression // Guanhua Wang // #278

Guanhua Wang, a Senior Researcher in the DeepSpeed team at Microsoft, dives into the revolutionary Domino training engine, designed to eliminate communication overhead during LLM training. He discusses the intricacies of naming the Phi-3 model and the growing interest in smaller language models. Wang highlights advanced techniques like data offloading and quantization, showcasing how Domino can speed up training by up to 1.3x compared to existing methods, while addressing privacy in customizable copilot models. It's a deep dive into optimizing AI training!

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app