MLOps.community  cover image

The Art and Science of Training LLMs // Bandish Shah and Davis Blalock // #219

MLOps.community

00:00

Efficiency of Approximate Matrix Multiplication Using Vector Quantization

This chapter explores a novel method of approximate matrix multiplication through vector quantization, reducing the number of operations needed for efficient computation. It compares this technique with binary neural networks, highlighting the benefits of vector quantization's expressivity and flexibility in leveraging mutual information across parameters.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app