Delay-Optimal Data Packet Transmission in Dense mmWave Networks using Structured Reinforcement Learning
The authors propose a structured reinforcement learning solution called mmDPT-TS to efficiently solve the delay-optimal data packet transmission problem in dense mmWave networks, which is formulated as a restless multi-armed bandits problem with fairness constraints (RMAB-F).