undefined

John Yang

Researcher and benchmark author focused on code evaluation and long-horizon AI coding agents; creator of SWE-bench and CodeClash and a Stanford PhD student working on human–AI collaboration and code evals.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app