The content delves into the application of Tsallis entropy as a one-parameter extension of Shannon entropy for optimal control. It discusses how this approach can achieve high entropy while maintaining sparsity in control policies through numerical examples and theoretical derivations. The study formulates Tsallis entropy regularized optimal control problems, deriving Bellman equations and investigating linearly solvable Markov decision processes and linear quadratic regulators. The analysis showcases the utility of Tsallis entropy regularization in achieving a balance between exploration and sparsity in control laws.
Key points include:
Ke Bahasa Lain
dari konten sumber
arxiv.org
Wawasan Utama Disaring Dari
by Yota Hashizu... pada arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01805.pdfPertanyaan yang Lebih Dalam