AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The LAMA License and the Reward Modeling of the Chat Based Model
The LAMA license says you will not use the model weights, et cetera, to improve any other large language model. This is restricting that family of models that you're allowed to do that sort of thing with. They actually use two separate reward models in this fine tuning of the chat based model. One that was related to helpfulness and then the other one, which was related to safety.