Quantitative Analysis of Lipschitz Continuous Optimal Control Problems and Its Application to Reinforcement Learning
The authors rigorously analyze the stability and convergence properties of the value function QL associated with Lipschitz continuous optimal control problems, and leverage these insights to propose a new HJB-based reinforcement learning algorithm.