The content discusses the task of opinion summarization in the e-commerce domain, where the potential of additional sources such as product description and question-answers has been less explored. The authors propose a novel synthetic dataset creation (SDC) strategy that leverages information from reviews as well as additional sources to select one of the reviews as a pseudo-summary, enabling supervised training.
The authors introduce a Multi-Encoder Decoder framework for Opinion Summarization (MEDOS) that employs a separate encoder for each source, allowing effective selection of information while generating the summary. Due to the unavailability of test sets with additional sources, the authors extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries.
Experiments across nine test sets demonstrate that the combination of the SDC approach and MEDOS model achieves on average a 14.5% improvement in ROUGE-1 F1 over the state-of-the-art. Comparative analysis underlines the significance of incorporating additional sources for generating more informative summaries. Human evaluations further indicate that MEDOS scores relatively higher in coherence and fluency compared to existing models.
Ke Bahasa Lain
dari konten sumber
arxiv.org
Wawasan Utama Disaring Dari
by Tejpalsingh ... pada arxiv.org 04-09-2024
https://arxiv.org/pdf/2404.05243.pdfPertanyaan yang Lebih Dalam