The Radical AI Podcast cover image

The Limitations of ChatGPT with Emily M. Bender and Casey Fiesler

The Radical AI Podcast

CHAPTER

Chat GPT - What's in the Training Data?

The first and very fundamental problem with chat GPT is that we don't know what its training data is. I was involved in something called data statements which were inspired by like the component specifications that you get with electric components right if you're building something physical and related to that make Mitchell do model cards. There's a whole bunch of these that are out there and what's shared across them is this idea that if you don't knows what's in the training data then you are not positioned to decide if you can safely deploy the thing. So that's sort of a big red flag to me that we don’t know what's in this training data.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner