AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do We Measure Harmful Text?
In this paper, we focus on the example of offensive or toxic speech. What we can't do is just take a term like toxicity and then go away and mitigating it because we actually need to define what we mean. The nature of harm, is it representational or allocational? Is it in a single instance or is it over in a distribution? That's another kind of axis that will influence how we measure the harm, how we mitigate the harm and so on. And the last thing I'll say in terms of the dimensions that are useful to think about when we actually try to turn these higher level understandings into concrete practice is the role of context.