EfficientZero V2 introduces a general framework for sample-efficient RL algorithms, outperforming the current state-of-the-art in diverse tasks under limited data settings. The approach of EfficientZero V2 focuses on mastering both discrete and continuous control scenarios.
EfficientZero V2 outperforms current state-of-the-art algorithms in diverse tasks under limited data settings.