Core Concepts
The proposed SOC-MartNet is an efficient numerical method for solving high-dimensional Hamilton-Jacobi-Bellman (HJB) equations and stochastic optimal control problems without the need for explicit minimization of the Hamiltonian.
Abstract
The paper proposes a novel numerical method called SOC-MartNet for solving high-dimensional Hamilton-Jacobi-Bellman (HJB) equations and stochastic optimal control problems. The key ideas are:
Reformulate the HJB equation into a stochastic neural network learning process, where a control network and a value network are trained such that the associated Hamiltonian process is minimized and the cost process becomes a martingale.
Employ an adversarial network to enforce the martingale property of the cost process, based on the projection property of conditional expectations. This avoids the need for explicit minimization of the Hamiltonian over the control space at each time-space point.
The training algorithm enjoys parallel efficiency, as it is free of time-direction iterations during gradient computation, in contrast to existing deep learning methods for PDEs.
The proposed SOC-MartNet is shown to be effective and efficient for solving HJB-type equations and stochastic optimal control problems with dimensions up to 500.
Stats
The paper does not provide specific numerical data or statistics. The focus is on the methodology and algorithm development.
Quotes
The paper does not contain any direct quotes that are important to highlight.