Core Concepts
The greedy algorithm is applicable to a wider range of arm feature distributions than previously shown, including distributions with origin-asymmetric support, by considering mixture distributions and proposing new representational distribution classes.
Abstract
The paper considers the sparse linear contextual bandit problem, where the reward is affected by the inner product of the selected arm features and an unknown sparse parameter. Recent studies have developed sparsity-agnostic algorithms based on the greedy arm selection policy, but the analysis requires strong assumptions on the arm feature distribution to ensure sample diversity.
The authors show that the applicability of the greedy algorithm can be extended to a wider range of arm feature distributions in two ways:
Distributions having a mixture component of a greedy-applicable distribution are also greedy-applicable. This allows the greedy algorithm to be applied to a broader class of distributions.
The authors propose new representational distribution classes, related to Gaussian mixture, discrete, and radial distributions, that are greedy-applicable. These classes can describe distributions with origin-asymmetric support, which were not covered by previous assumptions.
The authors provide theoretical guarantees for the greedy policy under these new distribution classes and demonstrate the usefulness of their analysis by applying it to other cases, such as the thresholded lasso bandit, combinatorial setting, and non-sparse setting.