The content delves into the application of Tsallis entropy as a one-parameter extension of Shannon entropy for optimal control. It discusses how this approach can achieve high entropy while maintaining sparsity in control policies through numerical examples and theoretical derivations. The study formulates Tsallis entropy regularized optimal control problems, deriving Bellman equations and investigating linearly solvable Markov decision processes and linear quadratic regulators. The analysis showcases the utility of Tsallis entropy regularization in achieving a balance between exploration and sparsity in control laws.
Key points include:
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Yota Hashizu... alle arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01805.pdfDomande più approfondite