The content introduces the SiCF score approach for semi-supervised dialogue abstractive summarization. It addresses label noise in pseudolabels by measuring semantic invariance, coverage, and faithfulness. The SiCF score is shown to be effective in enhancing uncertainty estimation and improving dialogue summarization.
The study prioritizes abstractive summarization over extractive approaches due to its flexibility. Challenges like scarcity of annotations are addressed through pre-trained models and unlabeled dialogues. The proposed SiCF score framework evaluates summary quality without relying on ground truth summaries.
Previous research focused on data augmentation for semi-supervised dialogue summarization but overlooked pseudolabel noise. The study aims to enhance performance by measuring pseudolabel quality and eliminating unreliable pseudolabels.
Various methods have been proposed for label noise in natural language understanding tasks, but they may not directly apply to SSDS due to diverse ground truth summaries. The study introduces a new approach to address pseudolabel noise in SSDS effectively.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Jianfeng He,... lúc arxiv.org 03-08-2024
https://arxiv.org/pdf/2403.04073.pdfYêu cầu sâu hơn