Data Skeptic cover image

Prompt Refusal

Data Skeptic

CHAPTER

The Importance of Data Labeling in Machine Learning

The number of questions you can formulate, even just in English, is really vast. One question that's open in my mind still is did we get a large enough data set after bootstrapping? Ten thousand questions is a lot of language. As Max mentioned, our refusal classification was pretty decent at about 96%. So I don't know if that is going to destroy the performance. Maybe it would. That's another thing to experiment with.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner