Optimizing online algorithms with limited coverage through hybrid reinforcement learning.
Optimizing exploration in reinforcement learning by incorporating offline data to improve coverage and efficiency.
Pointer Q-Network (PQN) combines Ptr-Nets and Q-learning to address challenges in the Orienteering Problem (OP) efficiently.