AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Cost Function of a Controller Is Quadratic.
So how is it that not knowing the model and kind of learning it, but not very well, gives us good performance? That's a very, very good question. No caso is like kind ofi from the paadime wehave been using in classical reinforcemetlering, algerdam. Use this controller that is going to do well on this environment the real world. And if it's not performing, will make it better,. despite the fact you don't know the real world i gas.