AI Safety Fundamentals: Alignment cover image

AI Safety via Debate

AI Safety Fundamentals: Alignment

00:00

Theorem 1: The Complexity Class of Epsilon Subscripts

There exists an X such that for all Y, H of Q, X and Y is true. Alice wins if she can find X such that all responses by Bob have H of Q equal to 1. For polynomial time H, we can continue this process for any number of rounds with Alice and Bob alternating points and counterpoints. This complexity class is epsilon subscript 2P, two steps up the polynomially hierarchy. All questions are decidable by polynometric space algorithms in SIPSA 2013.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app