AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How Do You Train the AI?
In our Chocolate Bar application, the feedback channel was very easy to describe. The goal is to place the product on the outlet conveyor belt within a certain position so that the plastic bag packaging machine works in fixed time intervals. And in this way, placing the chocolate bar at the correct position on the last conveyor belt yields the highest possible reward to our reinforcement learning algorithm. In turn, the more distant the chocolate bar was placed, the lower our reward signal becomes.