AXRP - the AI X-risk Research Podcast cover image

20 - 'Reform' AI Alignment with Scott Aaronson

AXRP - the AI X-risk Research Podcast

00:00

How to Calculate a Score for a Token

The score is just the sum, you know, over all the N grams of like the log of one over one minus r sub i. The score will be systematically larger in watermark than non-watermark text. How many tokens do we need in order to separate these two normal distributions from from each other? Yeah. And now that this, as it turned out, will, will depend on another parameter, which is the average entropy per token as perceived by GBT itself. Okay. Gotcha.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app