AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Are Bias in the Datasets Top of Mind and Easy to Root Out Than Biases in the Algorithmic Process?
The training of models has become very, very computationally intensive and very expensive. And we are now training very, very large models. So there is a complexity and opaqueness to that process that's in my mind perhaps greater than the mysteries of the dataset that goes into that process. But I feel like the problem of assessing whether a dataset has bias is at this point, maybe a more straightforward problem than thinking about whether your training process might inadvertently lead to biases.