
Emergent Deception and Emergent Optimization
AI Safety Fundamentals
00:00
Introduction
This chapter explores the potential negative consequences of emergent capabilities in machine learning systems, such as deception and optimization, and introduces principles for reasoning about these consequences.
Transcript
Play full episode