AXRP - the AI X-risk Research Podcast cover image

16 - Preparing for Debate AI with Geoffrey Irving

AXRP - the AI X-risk Research Podcast

CHAPTER

How to Find Failures in a Language Model?

The main thing we learn is that it is not too r to find failures in this way. The question will be cand of how that carries forward once you do. And i think there's a because that if the space, the generally prompting of these models works quite well and is quite flexible, you can find quite a lot of modes of failure using this kind of approach.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner