The paper proposes a two-step method to jointly learn the cost function and constraints from human demonstrations.
In the first step, the demonstrations are segmented into active and inactive parts based on the assumption that unknown constraints only affect certain parts of the demonstrations. The inactive segments are then used to learn the underlying cost function that drives the system's behavior in the absence of unknown constraints.
In the second step, the learned cost function is used to identify the unknown constraints. Deviations of the trajectory from the unconstrained behavior are attributed to the unknown constraints. The method can handle both inclusive and exclusive constraints, with the latter being relaxed into a convex formulation.
The proposed approach is validated through simulations with varying numbers of demonstrations and unknown constraints, as well as a real-world robotic manipulation task. The experiments show the importance of accurately estimating the cost function for the constraint learning process, and that the joint learning of cost and constraints can closely match the performance of the known cost and constraints.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Shivam Chaub... alle arxiv.org 05-07-2024
https://arxiv.org/pdf/2405.03491.pdfDomande più approfondite