The paper proposes a novel framework called DIMO (Disentangling ID and MOdality effects) to disentangle the effects of item ID and modality in session-based recommendation.
At the item level, DIMO introduces a co-occurrence representation schema to explicitly incorporate co-occurrence patterns into ID embeddings, and aligns different modalities (text, images) into a unified semantic space.
At the session level, DIMO presents a multi-view self-supervised disentanglement approach, including a proxy mechanism and counterfactual inference, to distinguish ID and modality effects without supervised signals.
Leveraging the disentangled causes, DIMO provides recommendations via causal inference and generates two types of explanations: co-occurrence template and feature template.
Extensive experiments on multiple real-world datasets demonstrate DIMO's consistent superiority over existing state-of-the-art methods in both accuracy and interpretability.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Xiaokun Zhan... lúc arxiv.org 04-22-2024
https://arxiv.org/pdf/2404.12969.pdfYêu cầu sâu hơn