The study introduces ROUGE-K, an evaluation metric focusing on keywords in summaries. It reveals that existing metrics may overlook essential information and proposes methods to guide models to include more keywords without compromising overall quality.
The research highlights the significance of including keywords in summaries for efficient information conveyance. Human annotators prefer summaries with more keywords as they capture important information better. The proposed ROUGE-K metric complements traditional metrics by providing a better index for evaluating summary relevance.
Experiments show that strong baseline models frequently fail to include essential words in their summaries. The study also evaluates large language models using ROUGE-K and demonstrates how it can differentiate system performance effectively. Additionally, four approaches are proposed to enhance keyword inclusion in transformer-based models.
Overall, the research emphasizes the importance of keyword inclusion in summaries and introduces a new metric, ROUGE-K, to address this aspect comprehensively.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Sotaro Takes... às arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.05186.pdfPerguntas Mais Profundas