Khái niệm cốt lõi
A framework that combines data-driven task anticipation using large language models and knowledge-driven planning to enable efficient human-robot collaboration, with the ability to adapt to unexpected changes in human action outcomes and preferences.
Tóm tắt
The paper presents DaTAPlan, a framework that combines data-driven task anticipation using large language models (LLMs) and knowledge-driven planning for efficient human-robot collaboration.
The key components of DaTAPlan are:
- An LLM is used to predict a sequence of anticipated tasks based on a small number of prompts containing partial task sequences and scene descriptions.
- A classical planner computes a plan of fine-granularity actions for the agent and the human to collaboratively achieve the anticipated tasks.
- The agent adapts to unexpected changes in human action outcomes or preferences by replanning from the current state or generating new task predictions.
The framework was evaluated in realistic household scenarios with multiple tasks, rooms, and objects. The results demonstrate:
- LLMs can accurately anticipate future tasks based on a small number of contextual examples.
- Combining task anticipation and action planning substantially improves the efficiency of planning and execution compared to using just the classical planner.
- Human-robot collaboration results in more efficient goal attainment compared to no active collaboration.
- The agent can automatically adapt to unexpected changes in action outcomes and preferences of humans.
Thống kê
The execution cost of the plan is the sum of the costs of all actions executed by the agent and the human.
The length of a plan is the total number of actions in the plan, computed as the sum of the number of actions executed by the agent and the human.