The paper introduces RS-DisRL for risk-sensitive RL with static LRM and general function approximation. It covers model-based and model-free approaches, providing theoretical guarantees for efficient learning. The work addresses challenges in sample complexity and extends to value function approximation.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Yu Chen,Xian... alle arxiv.org 02-29-2024
https://arxiv.org/pdf/2402.18159.pdfDomande più approfondite