Managing QoS in Multi-Tenant Cloud Services with Reinforcement Learning
Authors propose a novel approach using Deep Reinforcement Learning to manage tenant-specific QoS levels in multi-tenant, multi-accelerator cloud environments. The focus is on guaranteeing model-specific QoS levels while considering real-time constraints.