The content discusses the challenges of offline learning in zero-sum games and proposes a novel approach, ELA, to estimate exploited levels and improve learning efficiency. It introduces a Partially-trainable-conditioned Variational Recurrent Neural Network (P-VRNN) for unsupervised strategy representation learning and demonstrates its effectiveness through various game examples.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Shiqi Lei,Ka... في arxiv.org 03-01-2024
https://arxiv.org/pdf/2402.18617.pdfاستفسارات أعمق