References
Bertsekas, Dimitri P. 2012. Dynamic Programming and Optimal Control, Vol. II, 4th Edition. Athena Scientific.
———. 2017. Dynamic Programming and Optimal Control, Vol. I. Athena Scientific.
Bertsekas, D., and J. Tsitsiklis. 1996. Neuro-Dynamic Programming. Athena Scientific.
Sutton, R., and A. Barto. 1998. Reinforcement Learning: An Introduction. MIT Press.