toplogo
Đăng nhập

Learning Smooth Humanoid Robot Locomotion Using Lipschitz-Constrained Policies for Robust Real-World Transfer


Khái niệm cốt lõi
Lipschitz-Constrained Policies (LCP), a novel method using a differentiable gradient penalty to enforce smooth action outputs, offers a simple and effective alternative to traditional smoothing techniques for training robust locomotion controllers in humanoid robots, enabling successful sim-to-real transfer.
Tóm tắt
edit_icon

Tùy Chỉnh Tóm Tắt

edit_icon

Viết Lại Với AI

edit_icon

Tạo Trích Dẫn

translate_icon

Dịch Nguồn

visual_icon

Tạo sơ đồ tư duy

visit_icon

Xem Nguồn

Chen, Z., He, X., Wang, Y.-J., Liao, Q., Ze, Y., Li, Z., Sastry, S. S., Wu, J., Sreenath, K., Gupta, S., & Peng, X. B. (2024). Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies. arXiv preprint arXiv:2410.11825.
This research paper aims to address the challenge of transferring reinforcement learning (RL) based locomotion policies from simulation to real-world humanoid robots by introducing a novel method called Lipschitz-Constrained Policies (LCP) for enforcing smooth and robust behaviors.

Thông tin chi tiết chính được chắt lọc từ

by Zixuan Chen,... lúc arxiv.org 10-16-2024

https://arxiv.org/pdf/2410.11825.pdf
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies

Yêu cầu sâu hơn

0
star