How Do We Measure Harmful Text?

In this paper, we focus on the example of offensive or toxic speech. What we can't do is just take a term like toxicity and then go away and mitigating it because we actually need to define what we mean. The nature of harm, is it representational or allocational? Is it in a single instance or is it over in a distribution? That's another kind of axis that will influence how we measure the harm, how we mitigate the harm and so on. And the last thing I'll say in terms of the dimensions that are useful to think about when we actually try to turn these higher level understandings into concrete practice is the role of context.

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app