The study introduces ROUGE-K, an evaluation metric focusing on keywords in summaries. It reveals that existing metrics may overlook essential information and proposes methods to guide models to include more keywords without compromising overall quality.
The research highlights the significance of including keywords in summaries for efficient information conveyance. Human annotators prefer summaries with more keywords as they capture important information better. The proposed ROUGE-K metric complements traditional metrics by providing a better index for evaluating summary relevance.
Experiments show that strong baseline models frequently fail to include essential words in their summaries. The study also evaluates large language models using ROUGE-K and demonstrates how it can differentiate system performance effectively. Additionally, four approaches are proposed to enhance keyword inclusion in transformer-based models.
Overall, the research emphasizes the importance of keyword inclusion in summaries and introduces a new metric, ROUGE-K, to address this aspect comprehensively.
Til et annet språk
fra kildeinnhold
arxiv.org
Viktige innsikter hentet fra
by Sotaro Takes... klokken arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.05186.pdfDypere Spørsmål