AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Importance of Data Accumulation Techniques
LLM's, large language models, famously trained themselves by first scouring the internet. Do you do something similar with recorded voices as you're getting your models to an initial state of training? Like you go out and inhale gazillions of hours of audio did you when you were initially training your models? So we have a slightly different approach to collecting data. Darrin: We found some interesting techniques to kind of not totally scour the internet for data and kind of do it in a way that's more ethical.