589: The Correct Amount of Rocks
Accidental Tech Podcast
00:00
In-depth Analysis of AI Inference Performance Standards
The chapter delves into the measurement standard of trillions of operations per second (TOPS) in AI inference performance, specifically focusing on the use of 8-bit integers across tech products like Apple and Qualcomm. It explores the relevance of int 8 precision to neural processing units (NPUs) and compares TOPS between different products to highlight their performance based on this standard.
Transcript
Play full episode