The content discusses the task of opinion summarization in the e-commerce domain, where the potential of additional sources such as product description and question-answers has been less explored. The authors propose a novel synthetic dataset creation (SDC) strategy that leverages information from reviews as well as additional sources to select one of the reviews as a pseudo-summary, enabling supervised training.
The authors introduce a Multi-Encoder Decoder framework for Opinion Summarization (MEDOS) that employs a separate encoder for each source, allowing effective selection of information while generating the summary. Due to the unavailability of test sets with additional sources, the authors extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries.
Experiments across nine test sets demonstrate that the combination of the SDC approach and MEDOS model achieves on average a 14.5% improvement in ROUGE-1 F1 over the state-of-the-art. Comparative analysis underlines the significance of incorporating additional sources for generating more informative summaries. Human evaluations further indicate that MEDOS scores relatively higher in coherence and fluency compared to existing models.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Tejpalsingh ... at arxiv.org 04-09-2024
https://arxiv.org/pdf/2404.05243.pdfDeeper Inquiries