Reddit is an incredible trove of natural language data. 57 million people every day go to Reddit to engage in conversations around basically every topic you can think of. That has made it a honeypot of data for AI training. Companies like Google, OpenAI, and Microsoft have all used Reddit conversations in the development of their foundation models.
The AI Data Wars come to Twitter as Elon Musk rate limits users in an attempt to block AI data scraping. The move follows big changes to the Reddit API that some have called the end of the internet as we know it. Before that on The Brief: Valve has said they won't approve games for Steam that use AI art that might have copyright issues; Humane shares more information about its Ai Pin wearable; AI enthusiasm in markets is causing some people to worry.
Today's Sponsor:
Supermanage - AI for 1-on-1's - https://supermanage.ai/breakdown
ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI. Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/