A continual reinforcement learning approach that leverages forward transfer of knowledge between optimization policies with overlapping subsets of actions to learn the ultimate policy in a data-efficient task-oriented fashion, enabling a two-fold reduction in deployment lead-time compared to a reinitialize-and-retrain baseline.


coremsg

continual-model-based-reinforcement-learning-for-efficient-wireless-network-parameter-optimization


Continual Model-based Reinforcement Learning for Efficient Wireless Network Parameter Optimization