Equivariant Policies for Robust Zero-Shot Coordination in Decentralized Partially Observable Markov Decision Processes
Equivariant network architectures can effectively leverage environmental symmetry to improve zero-shot coordination between independently trained agents in decentralized partially observable Markov decision processes.