
The fastest agent in the race has the best evals
The Stack Overflow Podcast
00:00
How Groq achieves faster, cheaper inference
Benjamin describes Groq's LPU chip, RockCloud, and software optimizations like speculative decoding to speed inference.
Play episode from 07:35
Transcript


