Efficient Novelty Recovery in Long-Horizon Tasks using Planning and Reinforcement Learning
The agent learns a "bridge policy" via Reinforcement Learning to efficiently adapt to novel situations encountered during long-horizon tasks, by leveraging its planning model.