Get the app
Markus Nagel
Research scientist at Qualcomm AI Research, focusing on machine learning efficiency and inference efficiency techniques like quantization and pruning. Presented research at NeurIPS 2023.
Best podcasts with Markus Nagel
Ranked by the Snipd community
9 snips
Dec 26, 2023
• 47min
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
chevron_right
In this discussion, Markus Nagel, a research scientist at Qualcomm AI Research, shares insights from his recent papers at NeurIPS 2023, focusing on machine learning efficiency. He tackles the challenges of quantizing transformers, particularly in minimizing outlier issues in attention mechanisms. The conversation explores the pros and cons of pruning versus quantization for model weight compression and dives into innovative methods for multitask and multidomain learning. Additionally, the use of geometric algebra in enhancing algorithms for robotics is highlighted.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app