AXRP - the AI X-risk Research Podcast cover image

17 - Training for Very High Reliability with Daniel Ziegler

AXRP - the AI X-risk Research Podcast

00:00

Is There a Violence in Alice in Wonderland?

There's a lot of alec's rider fenfick out there. The real metric that we care about, if we're valuading the system as a whole, sort of cares more about prompts that are more likely to be to producea completions that have an injury. Col: How often does your classifier give false negatives on totally random, randoml sampled snipbets? O, yes, interesting how i didn't know such a popular franchise, but in fact, there's a lot  of alec’s rider fanfick. Ah, so this was pretty biased towards it. Even that wasn't literally everything we trained on.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app