Core Concepts
This paper presents a novel data-driven approach to simultaneously achieve desirable control and system identification objectives in linear switching systems. The proposed algorithm leverages recent advances in non-asymptotic analysis of linear least-square methods to efficiently identify the unknown system parameters within a finite number of steps.
Abstract
The paper focuses on the problem of linear system identification in the setting where the underlying partially-observed linear dynamical system lies within a finite collection of known candidate models. The authors first consider the problem of identification from a given trajectory, which reduces to identifying the index of the true model with high probability. They characterize the finite-time sample complexity of this problem by leveraging recent advances in the non-asymptotic analysis of linear least-square methods.
Next, the authors consider the switching control of linear systems, where there is a candidate controller for each of the candidate models and data is collected through interaction of the system with a collection of potentially destabilizing controllers. They develop a dimension-dependent criterion that can detect those destabilizing controllers in finite time. By leveraging these results, they propose a data-driven switching strategy that identifies the unknown parameters of the underlying system and provide a non-asymptotic analysis of its performance.
The key contributions are:
A least-square-based method for linear model identification with prior knowledge that the ground truth is contained in a finite collection of candidate models, yielding a dimension-free sample complexity bound.
An instability detection criterion that quantitatively bounds the finite-time input-to-output gain of a stable linear system.
A data-driven algorithm for linear system identification in switching control with non-asymptotic guarantees.
Implications of the non-asymptotic guarantees on the classical method of estimator-based supervisory control.
Stats
The paper does not contain any explicit numerical data or statistics. The technical results are presented in the form of theoretical bounds and guarantees.
Quotes
"We develop a dimension-dependent criterion that can detect those destabilizing controllers in finite time."
"By leveraging these results, we propose a data-driven switching strategy that identifies the unknown parameters of the underlying system and provide a non-asymptotic analysis of its performance."