Accelerating Torque-Based Legged Locomotion Policies with Decaying Action Priors
A two-stage framework that leverages position-based imitation data and decaying action priors to accelerate the training of torque-based legged locomotion policies, enabling consistent convergence to high-quality gaits.