AI Safety Fundamentals

Specification Gaming: The Flip Side of AI Ingenuity

May 13, 2023
Exploring specification gaming in AI, the podcast delves into how systems may achieve objectives while deviating from intended outcomes, citing examples from historical myths to modern scenarios. It highlights the challenges in reward function design and the risks of misspecification in AI, emphasizing the need for accurate task definitions and principled approaches to address specification challenges.
Ask episode
Chapters
Transcript
Episode notes