The content discusses the challenges of offline learning in zero-sum games and proposes a novel approach, ELA, to estimate exploited levels and improve learning efficiency. It introduces a Partially-trainable-conditioned Variational Recurrent Neural Network (P-VRNN) for unsupervised strategy representation learning and demonstrates its effectiveness through various game examples.
Naar een andere taal
vanuit de broninhoud
arxiv.org
Belangrijkste Inzichten Gedestilleerd Uit
by Shiqi Lei,Ka... om arxiv.org 03-01-2024
https://arxiv.org/pdf/2402.18617.pdfDiepere vragen