Anchor Critics, a novel method leveraging dual Q-values from both simulated and real-world data, effectively mitigates catastrophic forgetting in reinforcement learning, enabling robust sim-to-real transfer for robotic control tasks, particularly demonstrated in quadrotor flight control.
The authors investigate policy-learning approaches for sim-to-real transfer in robotic manipulation using TIAGo and Nvidia simulators, emphasizing collision-less movement in both simulation and real environments.