AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Chat GPT - What's in the Training Data?
The first and very fundamental problem with chat GPT is that we don't know what its training data is. I was involved in something called data statements which were inspired by like the component specifications that you get with electric components right if you're building something physical and related to that make Mitchell do model cards. There's a whole bunch of these that are out there and what's shared across them is this idea that if you don't knows what's in the training data then you are not positioned to decide if you can safely deploy the thing. So that's sort of a big red flag to me that we don’t know what's in this training data.