Addressing High-Stakes AI Safety Challenges

Exploring the crucial role of explicit assumptions in AI safety research, focusing on scalable oversight projects and the distinction between low and high-stakes contexts. Emphasizing the risks posed by untrusted models undermining safety measures intentionally, and the need for specialized techniques in high-stakes settings.

Transcript

Play full episode

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app