Optimal Quantum Control Policies for Markov Decision Processes
The authors introduce a novel mathematical formulation of quantum Markov decision processes (q-MDPs) that generalizes classical MDPs to the quantum domain. They establish a verification theorem demonstrating the sufficiency of Markovian quantum control policies and provide a dynamic programming principle for q-MDPs.