
Why the AI Race Undermines Safety (with Steven Adler)
Future of Life Institute Podcast
00:00
Methods to mitigate evaluation awareness
Adler surveys realism filtering, chain-of-thought inspection, and interpretability, noting limits and brittleness.
Play episode from 35:08
Transcript


