AXRP - the AI X-risk Research Podcast

34 - AI Evaluations with Beth Barnes

43 snips
Jul 28, 2024
Beth Barnes, the founder and head of research at METR, dives into the complexities of evaluating AI systems. They discuss tailored threat models and the unpredictability of AI performance, stressing the need for precise assessment methodologies. Barnes highlights issues like sandbagging and behavior misrepresentation, emphasizing the importance of ethical considerations in AI evaluations. The conversation also touches on the role of policy in shaping effective evaluation science, as well as the disparities between different AI labs in security and monitoring.
Ask episode
Chapters
Transcript
Episode notes