
17 - Training for Very High Reliability with Daniel Ziegler
AXRP - the AI X-risk Research Podcast
00:00
Is There a Violence in Alice in Wonderland?
There's a lot of alec's rider fenfick out there. The real metric that we care about, if we're valuading the system as a whole, sort of cares more about prompts that are more likely to be to producea completions that have an injury. Col: How often does your classifier give false negatives on totally random, randoml sampled snipbets? O, yes, interesting how i didn't know such a popular franchise, but in fact, there's a lot of alec’s rider fanfick. Ah, so this was pretty biased towards it. Even that wasn't literally everything we trained on.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.