The paper addresses the issue of redundant computation in multi-agent systems (MASs) where multiple agents redundantly perform the same or similar computation due to overlapping observations. To mitigate this problem, the authors propose a framework called locally centralized execution (LCE) and a method called locally centralized team transformer (LCTT).
LCE establishes a cooperative framework where agents are dynamically organized into leader and worker roles based on the directionality of instruction messages. The team transformer (T-Trans) architecture allows leaders to provide specific instructions to each worker, and the leadership shift mechanism enables agents to autonomously decide their roles as leaders or workers.
The proposed LCTT was implemented upon the QMIX algorithm and evaluated in a level-based foraging (LBF) problem. The experimental results demonstrate that LCTT effectively reduces redundant computation without decreasing reward levels and leads to faster learning convergence compared to baselines like QMIX and multi-agent incentive communication (MAIC).
The authors also introduce a metric called the redundant observation ratio to quantify the extent of redundant computations in multi-agent systems. This metric allows for a direct comparison of the computational expenses between different deep learning-based methods.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Yidong Bai,T... at arxiv.org 04-23-2024
https://arxiv.org/pdf/2404.13096.pdfDeeper Inquiries