589: The Correct Amount of Rocks
Accidental Tech Podcast
In-depth Analysis of AI Inference Performance Standards
The chapter delves into the measurement standard of trillions of operations per second (TOPS) in AI inference performance, specifically focusing on the use of 8-bit integers across tech products like Apple and Qualcomm. It explores the relevance of int 8 precision to neural processing units (NPUs) and compares TOPS between different products to highlight their performance based on this standard.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.