Основні поняття
A game-based distributed decision approach is proposed to solve the multi-agent optimal coverage problem, where the equivalence between the equilibrium of the game and the extreme value of the global performance objective is proven. A distributed algorithm is developed to obtain the global near-optimal coverage using only local information, and its convergence is analyzed and proved. The proposed method is applied to maximize the covering time of a satellite constellation for a target.
Анотація
This paper focuses on the optimal coverage problem (OCP) for multi-agent systems with decentralized optimization. The authors propose a game-based distributed decision approach for the multi-agent OCP and prove the equivalence between the equilibrium of the game and the extreme value of the global performance objective.
The key highlights are:
-
A game model is formulated where each agent aims to maximize its local performance objective, which is designed to be equivalent to the global performance objective. This enables distributed decision-making.
-
A distributed algorithm is developed to find the optimal solution of the OCP, and its convergence is strictly analyzed and proved. The mechanism of ε-innovator is proposed to improve the global performance by only allowing ε-innovators to update policies in each iteration.
-
The proposed method is applied to a satellite constellation reconfiguration problem, where satellites try to maximize the total visible time window for an observation target while saving energy. Simulation results show the proposed method can significantly improve the solving speed of the OCP compared to the traditional centralized method.
Статистика
The simulation uses the following key parameters:
Convergence accuracy, ε: 0.1
Total number of iterations, P: 20
Scale coefficient, γ: 0.2
Simulation duration: 24 hours
Discrete interval: 5 seconds
Semi-major axis of the orbit, a: 6896.27 km
Orbital inclination, i: 98°
Initial RANN, Ω(t0): 284.507°
Initial Greenwich sidereal hour angle, G0: 284.507°
Longitude and latitude of the target: (121.3°, 31.1°)
Geocentric angle of satellite observation range, ρ̄: 9.45°
Strategy space of k-th satellite, Θk: [-15°, 15°]
Initial phase of sk, Mk(t0): (k-1) * 15°
Initial energy surplus coefficient of sk, θk,max: 1
Цитати
"The computing time of DOCS is much less than that of the centralized method when they use the same solver, and the values of global performance obtained by different methods are close."
"DOCS fminbnd always has shorter computing times than DOCS pattern, and the values of global performance objective obtained by DOCS fminbnd are always greater than that by DOCS pattern."