An Asymptotically Optimal Algorithm for Determining if a Point or Interval Belongs to the Convex Hull of Multi-Armed Bandit Means
This paper presents Thompson-CHM, a novel Thompson-Sampling-based algorithm that efficiently determines if a given point or interval lies within the convex hull of means of a set of probability distributions, achieving asymptotic optimality in sample complexity for this problem.