The study introduces ROUGE-K, an evaluation metric focusing on keywords in summaries. It reveals that existing metrics may overlook essential information and proposes methods to guide models to include more keywords without compromising overall quality.
The research highlights the significance of including keywords in summaries for efficient information conveyance. Human annotators prefer summaries with more keywords as they capture important information better. The proposed ROUGE-K metric complements traditional metrics by providing a better index for evaluating summary relevance.
Experiments show that strong baseline models frequently fail to include essential words in their summaries. The study also evaluates large language models using ROUGE-K and demonstrates how it can differentiate system performance effectively. Additionally, four approaches are proposed to enhance keyword inclusion in transformer-based models.
Overall, the research emphasizes the importance of keyword inclusion in summaries and introduces a new metric, ROUGE-K, to address this aspect comprehensively.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Sotaro Takes... lúc arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.05186.pdfYêu cầu sâu hơn