The paper proposes a two-step method to jointly learn the cost function and constraints from human demonstrations.
In the first step, the demonstrations are segmented into active and inactive parts based on the assumption that unknown constraints only affect certain parts of the demonstrations. The inactive segments are then used to learn the underlying cost function that drives the system's behavior in the absence of unknown constraints.
In the second step, the learned cost function is used to identify the unknown constraints. Deviations of the trajectory from the unconstrained behavior are attributed to the unknown constraints. The method can handle both inclusive and exclusive constraints, with the latter being relaxed into a convex formulation.
The proposed approach is validated through simulations with varying numbers of demonstrations and unknown constraints, as well as a real-world robotic manipulation task. The experiments show the importance of accurately estimating the cost function for the constraint learning process, and that the joint learning of cost and constraints can closely match the performance of the known cost and constraints.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Shivam Chaub... lúc arxiv.org 05-07-2024
https://arxiv.org/pdf/2405.03491.pdfYêu cầu sâu hơn