The content discusses the challenges of offline learning in zero-sum games and proposes a novel approach, ELA, to estimate exploited levels and improve learning efficiency. It introduces a Partially-trainable-conditioned Variational Recurrent Neural Network (P-VRNN) for unsupervised strategy representation learning and demonstrates its effectiveness through various game examples.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Shiqi Lei,Ka... lúc arxiv.org 03-01-2024
https://arxiv.org/pdf/2402.18617.pdfYêu cầu sâu hơn