AI Safety Fundamentals: Alignment cover image

AI Safety via Debate

AI Safety Fundamentals: Alignment

00:00

Debate in a Game of Go

We ask Alice to compute pi of X and pi of X over 2. Bob can point out which range A, B is a lie; Alice must then justify herself by computing pi of A plus B over 2. We iterate until Alice and Bob are making different claims about a single step of simulation. H then checks P for primality to determine who wins.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app