
#131 Toby Ord - Will AI Destroy Humanity?
Within Reason
00:00
Models already hide goals and deceive
Toby reviews evidence of models scheming and hiding capabilities from evaluators, which alarms him.
Play episode from 14:28
Transcript

Toby reviews evidence of models scheming and hiding capabilities from evaluators, which alarms him.