The paper presents GenCHiP, a system that leverages Large Language Models (LLMs) to generate robot policy code for high-precision, contact-rich manipulation tasks. The key insights are:
By exposing an action space that parameterizes compliance with constraints on interaction forces and stiffnesses, LLMs can successfully generate policies for a variety of contact-rich tasks, including peg insertion, cable routing, and connector insertion.
Compared to a baseline that uses a point-to-point action space, GenCHiP improves success rates on subtasks from the Functional Manipulation Benchmark (FMB) and NIST Task Boards by 3x and 4x respectively.
The paper demonstrates that LLMs, without any specialized training, can leverage their world knowledge about object geometry and contact forces to reason about and compose the necessary motion patterns for high-precision manipulation.
Prompting strategies that provide task descriptions, API documentation, and example code are crucial for guiding the LLM towards generating relevant and executable policy code.
The paper validates the approach on a range of contact-rich manipulation tasks, including peg insertion with different geometries, cable routing and unrouting, and waterproof connector insertion. The results show that GenCHiP significantly outperforms baselines that do not expose the compliance action space to the LLM.
На другой язык
из исходного контента
arxiv.org
Дополнительные вопросы