This paper introduces TransDreamer, a novel reinforcement learning agent that leverages transformers for improved long-term memory and reasoning in visual control tasks, outperforming the previous state-of-the-art, Dreamer, in complex environments requiring long-range dependencies.
QT-TDM, a novel model-based reinforcement learning algorithm, leverages the strengths of Transformer Dynamics Models (TDM) and Autoregressive Q-Learning to achieve superior performance and sample efficiency in real-time continuous control tasks, effectively addressing the limitations of slow inference speed often associated with TDMs.