The Radical AI Podcast cover image

The Limitations of ChatGPT with Emily M. Bender and Casey Fiesler

The Radical AI Podcast

00:00

Chat GPT - What's in the Training Data?

The first and very fundamental problem with chat GPT is that we don't know what its training data is. I was involved in something called data statements which were inspired by like the component specifications that you get with electric components right if you're building something physical and related to that make Mitchell do model cards. There's a whole bunch of these that are out there and what's shared across them is this idea that if you don't knows what's in the training data then you are not positioned to decide if you can safely deploy the thing. So that's sort of a big red flag to me that we don’t know what's in this training data.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app