Unsupervised Learning cover image

Ep 7: Co-Creator of Databricks Dolly Mike Conover on Open-Source LLMs

Unsupervised Learning

CHAPTER

The Importance of Labeled Data in Chat GPT

Dolly v1 can write letters in a well formatted template. It understands salutations. The Dolly v2 does not, it can write letter like pros, but it doesn't understand what the beginning of a letter looks like. And so I think that is because our annotation rubric was, like I said, Google form,. You know, there's kind of like how, in part, you generally pay annotators to do this work. But we have started to, there's some function that is not well understood, or at least the people that understand it are not talking about it. We're just nudging the flywheel rather than requiring humans to think up everything that model knows

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner