The content delves into the application of Tsallis entropy as a one-parameter extension of Shannon entropy for optimal control. It discusses how this approach can achieve high entropy while maintaining sparsity in control policies through numerical examples and theoretical derivations. The study formulates Tsallis entropy regularized optimal control problems, deriving Bellman equations and investigating linearly solvable Markov decision processes and linear quadratic regulators. The analysis showcases the utility of Tsallis entropy regularization in achieving a balance between exploration and sparsity in control laws.
Key points include:
เป็นภาษาอื่น
จากเนื้อหาต้นฉบับ
arxiv.org
ข้อมูลเชิงลึกที่สำคัญจาก
by Yota Hashizu... ที่ arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01805.pdfสอบถามเพิ่มเติม