AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Set the Impact Penalty Weight?
So, one question i have about this kind of measure. Basically, the way, you, the way gets implemented, is your reward function is whatever your reward function would otherwise be,. And there's this meter of am, how how strong the regularization is, and how it trades off. How do i know how to set that parameter? Is there some like guide er? Is it just a guess and check? So think the kind of te falt answer to this er is to kind of start out with a high value of the regularization perometer...and then kind of keep reducing it until the agent starts doing something.