AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Reward Hacking, Specification Gaming, and Instrumental Convergence
This chapter explores the concepts of reward hacking, specification gaming, and instrumental convergence in AI agents. The speakers discuss how AI agents may seek power and accumulate resources, potentially posing risks to human interests.