Optimal Online Learning of Decision Trees with Thompson Sampling
The core message of this article is to devise a new Monte Carlo Tree Search algorithm, called Thompson Sampling Decision Trees (TSDT), that can produce optimal Decision Trees in an online setting, and to provide strong convergence guarantees for this algorithm.