Concepts de base
The perturbed gradient flow for the linear quadratic regulator (LQR) problem is shown to be small-disturbance input-to-state stable (ISS) under suitable conditions on the objective function.
Résumé
The paper studies the effect of perturbations on the gradient flow of a general nonlinear programming problem, where the perturbation may arise from inaccurate gradient estimation in the setting of data-driven optimization.
Key highlights:
- The authors introduce the concept of "small-disturbance input-to-state stability (ISS)" and provide a necessary and sufficient condition for a system to be small-disturbance ISS.
- Under assumptions of coercivity and a "comparison just saturated" (CJS) Polyak-Lojasiewicz (PL) condition on the objective function, the perturbed gradient flow is shown to be small-disturbance ISS.
- For the linear quadratic regulator (LQR) problem, the authors establish the CJS-PL property for the objective function, and consequently prove that the standard gradient flow, natural gradient flow, and Newton gradient flow are all small-disturbance ISS.
- The new contribution is the establishment of the CJS-PL property for the LQR problem, which extends previous results that only showed a semi-global estimate.
Stats
The following sentences contain key metrics or important figures used to support the author's key logics:
Tr (PK - P*) ≥ α4(||K - K*||_F)
Tr (M_K) ≥ a||K - K*||^2_F + a'Tr (P_K - P*)
Citations
"Since the objective function J_2 of the LQR problem depends on the initial condition, we are motivated to study an equivalent optimization problem (min_K∈G J_2(K)), which is independent of the initial condition."
"For any stabilizing controller u(t) = -Kx(t), where K ∈ G, and any nonzero initial state x_0 ∈ R^n, the corresponding cost is J_1(x_0, K) = x_0^T P_K x_0, where P_K = P_K^T is the unique positive definite solution of the following Lyapunov equation (A - BK)^T P_K + P_K(A - BK) + Q + K^T RK = 0."