R2I introduces a new method, Recall to Imagine (R2I), integrating state space models (SSMs) into world models of model-based reinforcement learning agents. This integration aims to improve temporal coherence, long-term memory, and credit assignment. Through various tasks, R2I establishes a new state-of-the-art for challenging memory and credit assignment RL tasks. It showcases superhuman performance in the complex Memory Maze domain while maintaining comparable performance in classic RL tasks like Atari and DMC. R2I is faster than the state-of-the-art MBRL method, DreamerV3, resulting in faster convergence time.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Mohammad Rez... klo arxiv.org 03-08-2024
https://arxiv.org/pdf/2403.04253.pdfSyvällisempiä Kysymyksiä