核心概念
Tsallis entropy regularization balances exploration and sparsity in optimal control.
統計
Shannon entropy regularization is widely adopted in optimal control.
Tsallis entropy is a one-parameter extension of Shannon entropy.
Tsallis entropy is used for the regularization of linearly solvable MDP and LQR.
Tsallis entropy regularization balances exploration and sparsity in control policies.
Tsallis entropy regularized optimal control problem (TROC) is formulated for discrete-time systems.
引用
"Tsallis entropy is a one-parameter extension of Shannon entropy."
"Tsallis entropy regularization balances exploration and sparsity in control policies."
"TROC formulation addresses limitations of Shannon entropy in sparse control policies."