
Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23
EAG Talks
00:00
AI Models, Attacks, Correlations, and Self-Preservation
Discusses the correlation between AI model's knowledge and the attacks it can generate, risks of deceptively aligned models, self-preservation theory, and alternative methods for improving the models.
Transcript
Play full episode