AI Safety Fundamentals: Alignment cover image

Debate Update: Obfuscated Arguments Problem

AI Safety Fundamentals: Alignment

00:00

How to Defend a False Claim in an Unstructured Debate

These arguments are not just an artifact of our structured debate mechanism. They can also be used to sow confusion in a non-structured debate setting. The algorithm for constructing these arguments is something like, one, start with an answer you need to defend that may or may not be true. Two, pick a sub-claim related to the claim being debated that's probably true if your answer is correct and about 50% likely to be true in the case where your answer is wrong. And five, argue that this sub-claim is true and that it implies your answer. After repeating this procedure many times, either you're defending a claim that is in fact true, or you have a large

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app