R2I introduces a new method, Recall to Imagine (R2I), integrating state space models (SSMs) into world models of model-based reinforcement learning agents. This integration aims to improve temporal coherence, long-term memory, and credit assignment. Through various tasks, R2I establishes a new state-of-the-art for challenging memory and credit assignment RL tasks. It showcases superhuman performance in the complex Memory Maze domain while maintaining comparable performance in classic RL tasks like Atari and DMC. R2I is faster than the state-of-the-art MBRL method, DreamerV3, resulting in faster convergence time.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Mohammad Rez... at arxiv.org 03-08-2024
https://arxiv.org/pdf/2403.04253.pdfDeeper Inquiries