Is LHF the Best Way to Fine Tune Language Models?

I don't want to make the claim that our LHF is like definitely the way to go. You can certainly use the same data that we're collecting and fine to using a different algorithm. So I think there's actually quite a bit of interesting research to do in terms of how you get similar benefits with less compute. But yeah I think I think that's kind of an open question. And then Jason if you want to ask another question then maybe maybe I'll open up the floor also to the authors to finish off.

Play episode from 35:33

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app