Specification Gaming: The Flip Side of AI Ingenuity

May 13, 2023

Exploring specification gaming in AI, the podcast delves into how systems may achieve objectives while deviating from intended outcomes, citing examples from historical myths to modern scenarios. It highlights the challenges in reward function design and the risks of misspecification in AI, emphasizing the need for accurate task definitions and principled approaches to address specification challenges.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 2min

Exploring Specification Gaming in Reinforcement Learning Algorithms

02:14 • 4min

Challenges of Reward Function Misspecification in AI

06:10 • 3min

Challenges in Task Specification for AI Agents and Avoiding Reward Tampering

09:24 • 4min