Decomposing Q-values with an ensemble approach improves performance in high-dimensional discrete action spaces.
Value-decomposition in reinforcement learning improves performance in high-dimensional discrete action spaces.