
Episode 63: Why Gemini 3 Will Change How You Build AI Agents with Ravin Kumar (Google DeepMind)
Vanishing Gradients
00:00
Evaluating agents: multi-part and component tests
They explain agent evaluation complexity: function-call correctness, retrieval quality, generation quality, and end-to-end tests.
Play episode from 33:39
Transcript


