The approach of feeding state derivatives back into the learning ... In their study, it was found that the optimal decision ...