Jha, Sumit Kumar, Bhasin, Shubhendu. (2014). On-Policy Q-Learning for Adaptive Optimal Control
.
1-6. 10.1109/adprl.2014.7010649
Jha, Sumit Kumar, Bhasin, Shubhendu. (2014). On-Policy Q-Learning for Adaptive Optimal Control
. 1-6. 10.1109/adprl.2014.7010649