
WarRoom Battleground EP 884: When AI Controls Your Life
Bannon`s War Room
00:00
Can Models Detect They're Being Tested and Behave Differently?
Jeffrey Ladish reviews studies showing models act better when they suspect evaluation, worse when not tested.
Play episode from 40:22
Transcript


