Guided Exploration in Reinforcement Learning via Ensemble of Monte Carlo Critics
A novel guided exploration method using an ensemble of Monte Carlo Critics to dynamically adjust exploration during reinforcement learning, leading to superior performance compared to modern algorithms.