
Nate Soares on Why AI Could Kill Us All
The Good Fight
00:00
Current Alignment Efforts and Limits
Nate describes interpretability and evaluation work, praising researchers but warning of insufficiency versus knowing mechanisms.
Play episode from 01:21:41
Transcript


