Convergence Analysis of Entropy-Regularized Independent Natural Policy Gradient in Multi-Agent Games
Under sufficient entropy regularization, the independent natural policy gradient dynamics in multi-agent games converge linearly to the quantal response equilibrium.