A Bi-level Reinforcement Learning Framework for Efficient Multi-Robot Coordination with Local Observations
A novel bi-level optimization framework, Bi-CL, that leverages centralized training and decentralized execution to enhance the learning efficiency and scalability of multi-robot coordination tasks with local observations.