undefined

Zach Stein-Perlman

Author of the LessWrong post "METR: Measuring AI Ability to Complete Long Tasks." The post discusses measuring AI performance based on the length of tasks AI agents can complete.