The content delves into the application of Tsallis entropy as a one-parameter extension of Shannon entropy for optimal control. It discusses how this approach can achieve high entropy while maintaining sparsity in control policies through numerical examples and theoretical derivations. The study formulates Tsallis entropy regularized optimal control problems, deriving Bellman equations and investigating linearly solvable Markov decision processes and linear quadratic regulators. The analysis showcases the utility of Tsallis entropy regularization in achieving a balance between exploration and sparsity in control laws.
Key points include:
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Yota Hashizu... klo arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01805.pdfSyvällisempiä Kysymyksiä