How AI Is Built cover image

Embedding Intelligence: AI's Move to the Edge

How AI Is Built

00:00

Shrinking Neural Networks: Strategies and Innovations

This chapter explores methods for reducing the size of neural networks, including weight quantization and model distillation. The speaker shares practical experiences in training smaller models and discusses recent innovations like the 'moonshine' text-to-speech model, while expressing skepticism about commonly used techniques and highlighting challenges in quantization.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app