AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Limits of Guardrails for Generative Errors
I think it's very important to be thinking about the ways these systems could either be incorrect or biased or toxic. So in our case, we do a lot of generative adversarial testing of these systems. In fact, when you use BOD, for example, the output that you get when you type in a prompt is not necessarily the first thing that BOD came up with,. We're running 15, 16 different of the same prompt to look at those outputs and pre-assess them for safety.