The content discusses the application of Thompson Sampling in stochastic linear contextual bandits with noisy contexts. It introduces a modified algorithm and analyzes Bayesian cumulative regret. The article covers decision-making under uncertainty, challenges of noisy contexts, related works, motivation, problem settings, and novel approaches. It provides comparisons with existing algorithms and empirical demonstrations.
לשפה אחרת
מתוכן המקור
arxiv.org
תובנות מפתח מזוקקות מ:
by Sharu Theres... ב- arxiv.org 03-26-2024
https://arxiv.org/pdf/2401.11565.pdfשאלות מעמיקות