The Limits of Guardrails for Generative Errors

I think it's very important to be thinking about the ways these systems could either be incorrect or biased or toxic. So in our case, we do a lot of generative adversarial testing of these systems. In fact, when you use BOD, for example, the output that you get when you type in a prompt is not necessarily the first thing that BOD came up with,. We're running 15, 16 different of the same prompt to look at those outputs and pre-assess them for safety.

Play episode from 14:32

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app