
The Surprising Websites Included in ChatGPT's Training Data
AI Chat: ChatGPT, AI News, Artificial Intelligence, OpenAI, Machine Learning
00:00
The Importance of Personal Blogs in AI Models
30% of all the content that's used for so many different AI models is currently not available online anymore. Google has this massive, I call it a black hole of data because 30% of all their data is exclusively to them. Reddit recently said they're going to start charging companies to train models off of Reddit. The company typically uses a high quality data set to fine tune the models.
Play episode from 12:21
Transcript


