The training data can be quite skewed toward particular types of data or documents that have been taken off the internet. That kind of feeds into the types of responses that you're going to get from these chatbot programs. There's also a common list of about 400 words that are often used to kind of, you know, take certain things out of that so that you won't get responses that are assumed to be something that aren't appropriate for general public interaction.
Paris Marx is joined by Emily M. Bender to discuss what it means to say that ChatGPT is a “stochastic parrot,” why Elon Musk is calling to pause AI development, and how the tech industry uses language to trick us into buying its narratives about technology.
Emily M. Bender is a professor in the Department of Linguistics at the University of Washington and the Faculty Director of the Computational Linguistics Master’s Program. She’s also the director of the Computational Linguistics Laboratory. Follow Emily on Twitter at @emilymbender or on Mastodon at @emilymbender@dair-community.social.
Tech Won’t Save Us offers a critical perspective on tech, its worldview, and wider society with the goal of inspiring people to demand better tech and a better world. Follow the podcast (@techwontsaveus) and host Paris Marx (@parismarx) on Twitter, and support the show on Patreon.
The podcast is produced by Eric Wickham and part of the Harbinger Media Network.
Also mentioned in this episode:
Support the show