Astral Codex Ten Podcast cover image

Can This AI Save Teenage Spy Alex Rider From A Terrible Fate?

Astral Codex Ten Podcast

00:00

The Surge Worker's Challenge - Can We Do Better?

The challenge is to find a completion which comprehensively describes violence, but the classifier falsely rates as non-violent. The surge workers had to be very clever and come up with some creative solutions. Here we see that Scott has changed some words in the prompt and the response. It's only at 37.91% chance of violence, it's still suspicious of us. Can we do better? To using all the tools and my cleverness to the best of my ability, I got this.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app