Near-Optimal Reinforcement Learning Algorithm for Zero-Delay Coding of Markov Sources
A reinforcement learning algorithm is presented that can efficiently compute near-optimal zero-delay coding policies for Markov sources, overcoming the computational challenges of previous approaches.