Optimizing Model Performance with Serverless Computing

This chapter explores the intersection of economics and computing in serverless environments, focusing on strategies for efficient model training and experimentation. Key techniques such as weight pruning and quantization are discussed to enhance model efficiency and reduce costs while scaling larger machine learning models.

Play episode from 28:10

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app