AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Introduction
Exploring the Machiavelli benchmark paper evaluating power seeking and deception in language model agents, with a focus on realistic testing environments and tracking deceptive behavior instances.