AI Safety Fundamentals: Alignment cover image

Debate Update: Obfuscated Arguments Problem

AI Safety Fundamentals: Alignment

CHAPTER

How to Defend a False Claim in an Unstructured Debate

These arguments are not just an artifact of our structured debate mechanism. They can also be used to sow confusion in a non-structured debate setting. The algorithm for constructing these arguments is something like, one, start with an answer you need to defend that may or may not be true. Two, pick a sub-claim related to the claim being debated that's probably true if your answer is correct and about 50% likely to be true in the case where your answer is wrong. And five, argue that this sub-claim is true and that it implies your answer. After repeating this procedure many times, either you're defending a claim that is in fact true, or you have a large

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode