Data Skeptic cover image

Prompt Refusal

Data Skeptic

CHAPTER

The Importance of Predicting Refusal Responses

We have a refusal classifier trained on a small sample set about 2000. And then we feed it in the other 10,000 and it classifies those automatically. That is able to predict whether Chag GPT will refuse a prompt with 76% accuracy. Do you find that that's near optimal or is there room for improvement if more time and energy were invested? I don't know what I would say is the optimal percentage accuracy that I could do as a human.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner