Efficient Multi-Task Reinforcement Learning for Adaptive Traffic Signal Control
MTLIGHT enhances the agent observation with a latent state learned from numerous traffic indicators, and employs multiple auxiliary and supervisory tasks to learn the latent state, which improves the convergence speed and asymptotic performance of traffic signal control.