AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Negative Metrics of Chat TPT
The reason why chat TPT was initially so verbose was because the evaluators just they liked verbose output. That's a whole like you said, different conversation on its own. Like this whole area of RLHF reinforcement learning with human feedback. Yeah. As all these things to right take care of because make sure that the human feedback is diverse enough and not just the verbose versus non verbose person. So yeah, that's a very interesting conversation also.