Efficient Model-Free Exploration in Low-Rank MDPs: Overcoming Challenges with VoX Algorithm
The author proposes the VoX algorithm as a sample-efficient and model-free solution for exploration in Low-Rank MDPs, addressing challenges of computational efficiency and representation learning.