There is some attempt at filtering the data, because even the people who say this is just a representative sample would rather not have their dataset clogged with random text. So let's try to get those out. And when you use that list of words to filter out websites, you are gonna get rid of some porn and you are gonnaget rid of some hate websites. That's good, but it's not thorough for one thing. There's a whole bunch of words in there that actually have to do with gender identities and sexual identities which can show up on sites where people are positively speaking about these identities. If this is the future of search engines or whatnot as they're trying to pitch
Paris Marx is joined by Emily M. Bender to discuss what it means to say that ChatGPT is a “stochastic parrot,” why Elon Musk is calling to pause AI development, and how the tech industry uses language to trick us into buying its narratives about technology.
Emily M. Bender is a professor in the Department of Linguistics at the University of Washington and the Faculty Director of the Computational Linguistics Master’s Program. She’s also the director of the Computational Linguistics Laboratory. Follow Emily on Twitter at @emilymbender or on Mastodon at @emilymbender@dair-community.social.
Tech Won’t Save Us offers a critical perspective on tech, its worldview, and wider society with the goal of inspiring people to demand better tech and a better world. Follow the podcast (@techwontsaveus) and host Paris Marx (@parismarx) on Twitter, and support the show on Patreon.
The podcast is produced by Eric Wickham and part of the Harbinger Media Network.
Also mentioned in this episode:
Support the show