핵심 개념
Adapting the nearest neighbour rule to contextual bandits leads to an efficient algorithm with no assumptions about data generation.
통계
알고리즘은 다음을 수행합니다.
알고리즘은 다음을 계산합니다.
알고리즘은 다음을 유지합니다.
인용구
"Our algorithm handles the fully adversarial setting with no assumptions about the data-generation process."
"Our algorithm is extremely efficient with per-trial running time polylogarithmic in both the number of trials and actions."