WebCiteS introduces attributed query-focused summarization (AQFS) for Chinese web search results. The dataset features human-annotated summaries with citations derived from real-world user queries and search results. Evaluation metrics distinguish groundedness errors and citation errors, highlighting the challenge of explicit attribution in large language models. Models struggle with accurate citations, but supervised fine-tuning improves both summarization utility and attribution quality. Long-context settings reduce model performance, especially in accurately pinpointing supporting evidence within the context.
To Another Language
from source content
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Haolin Deng,... lúc arxiv.org 03-05-2024
https://arxiv.org/pdf/2403.01774.pdfYêu cầu sâu hơn