
16 - Preparing for Debate AI with Geoffrey Irving
AXRP - the AI X-risk Research Podcast
How to Find Failures in a Language Model?
The main thing we learn is that it is not too r to find failures in this way. The question will be cand of how that carries forward once you do. And i think there's a because that if the space, the generally prompting of these models works quite well and is quite flexible, you can find quite a lot of modes of failure using this kind of approach.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.