AI Safety Fundamentals: Alignment cover image

AI Safety via Debate

AI Safety Fundamentals: Alignment

CHAPTER

Debate in a Game of Go

We ask Alice to compute pi of X and pi of X over 2. Bob can point out which range A, B is a lie; Alice must then justify herself by computing pi of A plus B over 2. We iterate until Alice and Bob are making different claims about a single step of simulation. H then checks P for primality to determine who wins.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner