The key insights are:
The MSR can often be significantly smaller than the K1 calibration error, despite their linear relationship in the worst-case. This allows us to bypass the Ω(T^0.528) lower bound for the K1 calibration error.
We establish a general lemma (Lemma 5.2) that attributes MSR to bucket-wise biases. This lemma plays a crucial role in our analysis.
We show that the guarantee |b̂qi - qi| ≤ O(1/√ni) can be approximately achieved in the online binary prediction setting, using a refinement of the result from Noarov et al. (2023).
Combining Lemma 5.2 with the bound on |b̂qi - qi|, we obtain the final O(√T log T) expected MSR guarantee.
Our algorithm works in the standard online binary prediction setting. In each round t, the algorithm makes a prediction pt ∈ [0, 1] and the adversary reveals the true state θt ∈ {0, 1}. Both pt and θt can depend on the past history, but they cannot depend on each other. This allows our algorithm to leverage the power of randomization.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Lunjia Hu,Yi... às arxiv.org 04-23-2024
https://arxiv.org/pdf/2404.13503.pdfPerguntas Mais Profundas