There is some attempt at filtering the data, because even the people who say this is just a representative sample would rather not have their dataset clogged with random text. So let's try to get those out. And when you use that list of words to filter out websites, you are gonna get rid of some porn and you are gonnaget rid of some hate websites. That's good, but it's not thorough for one thing. There's a whole bunch of words in there that actually have to do with gender identities and sexual identities which can show up on sites where people are positively speaking about these identities. If this is the future of search engines or whatnot as they're trying to pitch

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode