Core Concepts
A data-driven method for identifying a maximal set of control strategies that guarantee an unknown stochastic system remains within a safe set with probabilistic safety assurances.
Abstract
The paper introduces a framework for computing a maximal set of permissible control strategies for stochastic systems with unknown dynamics. The key steps are:
Learning the system dynamics using Gaussian process (GP) regression and obtaining probabilistic error bounds.
Developing an algorithm that constructs piecewise stochastic barrier functions to find a maximal permissible strategy set using the learned GP model. This involves sequentially pruning the worst controls until a maximal set is identified.
The permissible strategies are guaranteed to maintain probabilistic safety for the true system, which is important for learning-enabled systems to enable safe data collection and complex behaviors.
Case studies on both linear and nonlinear systems demonstrate that increasing the dataset size for learning the system grows the permissible strategy set.
Stats
The paper does not provide specific numerical data, but it presents the following key figures:
The probability of the system remaining in the safe set for N time steps is lower bounded by 1 - (η + βN), where η and β are parameters derived from the piecewise stochastic barrier function synthesis.
For the linear system case, the permissible strategy set encompasses 93.6% of the total possible actions for the known system, 88.8% for the system learned with 500 data points, and 91.2% for the system learned with 2000 data points.
For the nonlinear system case, the permissible strategy set maintains 39.5%, 40.1%, and 49.5% of all available controls for datasets of 500, 1000, and 1500 data points, respectively.
Quotes
There are no direct quotes from the content.