This paper introduces System Neural Diversity (SND), a novel metric to measure behavioral heterogeneity in multi-agent reinforcement learning (MARL) systems.
The key highlights are:
The authors first define a pairwise inter-agent behavioral distance using the Wasserstein metric, which captures the distance between the stochastic action distributions of the agents. They then aggregate these pairwise distances into the system-level SND metric.
The paper compares SND to the state-of-the-art Hierarchic Social Entropy (HSE) metric, showing that SND has desirable properties that HSE lacks. Specifically, SND is invariant to the number of equidistant agents and provides a measure of behavioral redundancy, which HSE does not capture.
Experiments on static tasks, such as a multi-agent goal navigation problem, demonstrate that heterogeneous policies can outperform homogeneous ones when the task requires specialized behaviors. In dynamic tasks, where the environment undergoes repeated disturbances, the authors show that SND can reveal latent resilience skills acquired by the agents, while other proxies like task performance fail to do so.
Finally, the paper shows how SND can be used to control diversity, allowing the enforcement of a desired heterogeneity set-point or range. This paradigm can be used to bootstrap the exploration phase, finding optimal policies faster and enabling novel and more efficient MARL paradigms.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Matteo Betti... في arxiv.org 09-11-2024
https://arxiv.org/pdf/2305.02128.pdfاستفسارات أعمق