The content discusses a novel rule-driven news captioning method for generating image descriptions by incorporating a news-aware semantic rule. The method is evaluated on two datasets, demonstrating competitive performance against existing methods. Key points include the importance of named entities, the effectiveness of embedding semantic rules in deep layers, and qualitative analysis showcasing accurate caption generation.
Existing methods focus on large-scale pre-trained models for news captioning but overlook fundamental rules of news reporting. The proposed method integrates a news-aware semantic rule into BART to generate captions adhering to these rules. Experimental results show competitive performance on GoodNews and NYTimes800k datasets.
Key metrics such as BLEU-4, METEOR, ROUGE, CIDEr, and precision/recall scores for named entities are used to evaluate the method's performance. Ablation studies confirm the effectiveness of using the news-aware semantic rule and embedding named entities in the model.
Qualitative analysis demonstrates how the proposed method accurately captures key events in images to generate informative captions following designated rules. Future work may involve multi-modal knowledge integration for improved performance.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Ning Xu,Ting... lúc arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.05101.pdfYêu cầu sâu hơn