LessWrong (30+ Karma)

“ImpossibleBench: Measuring Reward Hacking in LLM Coding Agents” by Ziqian Zhong

Oct 30, 2025
Ask episode
Chapters
Transcript
Episode notes