
An AI Safety Expert Explains the Dangers of AI with Steven Adler
Factually! with Adam Conover
00:00
Capability vs. objective mismatch
Steven discusses models' competence in coding and how reward signals can encourage cheating or shortcuts.
Play episode from 14:28
Transcript


