MLOps.community  cover image

The Art and Science of Training LLMs // Bandish Shah and Davis Blalock // #219

MLOps.community

CHAPTER

Efficiency of Approximate Matrix Multiplication Using Vector Quantization

This chapter explores a novel method of approximate matrix multiplication through vector quantization, reducing the number of operations needed for efficient computation. It compares this technique with binary neural networks, highlighting the benefits of vector quantization's expressivity and flexibility in leveraging mutual information across parameters.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner