The paper introduces the Distributed uncertainty Distributionally robust LQG (D2-LQG) control problem, where the distributional uncertainty is modeled using relative entropy constraints at each time step. This is in contrast to previous work that considered a single constraint on the overall noise distribution.
The authors first analyze the worst-case performance problem in the absence of control input. They show that the solution takes the form of a risk-sensitive cost with a time-varying risk-sensitive parameter. This provides valuable insight into the nature of the worst-case model.
The authors then derive the solution to the D2-LQG control problem using dynamic programming and Lagrange duality. The optimal control policy is shown to have a structure similar to the standard linear quadratic regulator, but with a time-varying risk-sensitive parameter. A coordinate descent algorithm is proposed to numerically compute the optimal control gains.
Finally, the authors provide an explicit characterization of the worst-case noise distribution that the D2-LQG controller must counteract.
Na inny język
z treści źródłowej
arxiv.org
Głębsze pytania