The paper introduces RS-DisRL for risk-sensitive RL with static LRM and general function approximation. It covers model-based and model-free approaches, providing theoretical guarantees for efficient learning. The work addresses challenges in sample complexity and extends to value function approximation.
Til et andet sprog
fra kildeindhold
arxiv.org
Vigtigste indsigter udtrukket fra
by Yu Chen,Xian... kl. arxiv.org 02-29-2024
https://arxiv.org/pdf/2402.18159.pdfDybere Forespørgsler