MMAPS introduces innovative tasks and modeling techniques to enhance multi-modal learning for product summarization. Extensive experiments demonstrate its superiority over state-of-the-art methods, showcasing improved performance across various metrics.
Given the long textual product information and the product image, Multi-modal Product Summarization (MPS) aims to increase customers’ desire to purchase by highlighting product characteristics with a short textual summary. Existing MPS methods lack end-to-end product summarization, multi-grained multi-modal modeling, and multi-modal attribute modeling. To address these issues, MMAPS is proposed as an innovative solution for generating high-quality product summaries in e-commerce. MMAPS jointly models product attributes and generates product summaries by designing several multi-grained multi-modal tasks to guide the learning process effectively. By incorporating both text and image modalities for modeling product attributes, MMAPS outperforms existing methods in terms of summarization metrics on a real large-scale Chinese e-commerce dataset.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Tao Chen,Ze ... às arxiv.org 03-11-2024
https://arxiv.org/pdf/2308.11351.pdfPerguntas Mais Profundas