6min chapter

Practical AI: Machine Learning, Data Science, LLM cover image

AI trends: a Latent Space crossover

Practical AI: Machine Learning, Data Science, LLM

CHAPTER

How to Mix Up Public Data Sets to Make Your Model Better

I was thinking about unable datasets for unsupervised learning or self supervised learning, right? Like that is something that we are trying to grab our heads around like common crawl stack overflow archive the books. And as far as I can tell, nobody has a street answer as to how what the data mix is and everyone's just kind of experiments. Yeah, I get the sense that open AI doesn't want to encourage that anymore. They don't have fine tuning for 3.5 and 4. But each of those had a unique sort of flavor of this data under the hood that might actually work quite well for your use case. So one example that I've used recently in some work is the

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode