Are There Other Major Benefits of LHF?

The 1 billion parameter model that was trained with our LHF was performed on human evaluations roughly similarly or something like that, a little bit better than the 175 billion kind of vanilla GPT three. We often decompose that into some sub dimensions like helpfulness, harmfulness and honesty. And in fact, at least in this first paper, doesn't, most of the benefits come from improvements in helpfulness and honesty rather than harmlessness. Ryan, you want to talk about distillation a bit?

Play episode from 20:35

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app