Core Concepts
The paper proposes a novel technique called "probabilistic invariance" to efficiently design myopic controllers that can ensure long-term safety in stochastic control systems. This technique allows the controller to directly optimize for the probability of long-term safety or convergence, rather than just the short-term state constraints.
Abstract
The paper addresses the challenge of ensuring long-term safety in stochastic control systems, where traditional set-invariance based methods may not be sufficient due to the accumulation of uncertainties over time. To overcome this, the authors introduce a novel "probabilistic invariance" technique that characterizes the invariance conditions of the probability of interest, rather than the state space.
The key insights are:
An infinitesimal future value of a long-term probability provides explicit computable information about invariant sets in the probability space, unlike the state space.
This allows the authors to derive myopic conditions that can ensure long-term probabilistic safety or convergence.
The paper then integrates this technique into safe control and learning methods:
For control, the proposed technique can equip nominal controllers (e.g., neural networks) with long-term safety guarantees using either an additive modification or a constrained optimization approach.
For learning, the technique can be used to ensure long-term safety of the control policies during and after training.
The performance of the proposed techniques is demonstrated through numerical simulations.
Quotes
"The key insight is that its probabilistic analog, namely an infinitesimal future value of a long-term probability, does provide explicit computable information about invariant sets in the space of probability values."
"Condition (39) is a forward invariance condition on probability, while typical control barrier function (CBF) based methods perform forward invariance on state space."