AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do You Define Reinforcement Learning?
Lambda is an AI that tries to move towards the positive things and away from the negative things. The longer a conversation goes, the stronger that penalty is going to get. It's not perfect, but it's something. One of the guardrails that Lambda had was that it was supposed to be helping users with the task that they needed. That helps keep it on topic. And part of the reason why chat TBT succeeded where Galactica didn't, is Galacticadidn't really have those guardrails at all.