AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Use Discord to Enhance Mid Journey?
Luther AI is kind of the only organization out there that is collecting data that anyone can use to train anything. They have two large collections of data called the stack and the pile. The largest source of untranscribed text is essentially on YouTube. There's a predominant or prevailing theory that the primary purpose of whisper is to transcribe all video to get a text to train the models because we are so limited on data. We've basically exhausted where data constrained in terms of our ability to improve our models. Now let's go get all the data that's possible.