AutoGuide introduces a framework to extract state-aware guidelines from offline data, improving LLM agents' decision-making. By leveraging implicit knowledge in offline experiences, AutoGuide provides concise natural language guidelines that enhance an agent's performance. The method outperforms competitive baselines in sequential decision-making benchmarks by providing relevant guidelines at test time based on the current state.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Yao Fu,Dong-... lúc arxiv.org 03-15-2024
https://arxiv.org/pdf/2403.08978.pdfYêu cầu sâu hơn