AI Safety Fundamentals

Weak-To-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Jan 4, 2025
Ask episode
Chapters
Transcript
Episode notes