The AI Native Dev - from Copilot today to AI Native Software Development tomorrow cover image

AI Agents Beyond Context Limits | Maksim Shaposhnikov

The AI Native Dev - from Copilot today to AI Native Software Development tomorrow

00:00

How to Evaluate Agents for Code Generation

Simon asks about evals; Maksim discusses benchmark approaches like SVE Bench, test-based repo tasks, and limitations for multi-file or UI work.

Play episode from 49:25
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app