AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Challenges of Quantization for Large Models
The two key challenges were one, being able to work with this large model using your existing kind of tool chain and workflows. And then the second is some more specific challenges that arose in trying to achieve the desired performance levels. Did the ultimate solution from a quantization perspective was this kind of an out of the box push button application of the tool chain or was there a degree of research effort involved in achieving the result? We actually created the open source toolkit is called a emit, called AI model efficiency toolkit.