AI Safety Fundamentals: Alignment

Discovering Latent Knowledge in Language Models Without Supervision

May 13, 2023
Ask episode
Chapters
Transcript
Episode notes