
34 - AI Evaluations with Beth Barnes
AXRP - the AI X-risk Research Podcast
00:00
Navigating AI Evaluations: The Role of Meter and Labs
This chapter explores the intricate dynamics between Meter and AI labs, clarifying their respective roles in third-party evaluations without formal audits. It highlights the challenges of communication and transparency in the evaluation process, particularly amidst non-disclosure and non-disparagement agreements. The discussion emphasizes the need for industry standards and refined metrics to enhance trust and accountability in AI assessments.
Transcript
Play full episode