AI Safety Fundamentals: Alignment

High-Stakes Alignment via Adversarial Training [Redwood Research Report]

May 13, 2023
Ask episode
Chapters
Transcript
Episode notes