The paper proposes a novel framework called DIMO (Disentangling ID and MOdality effects) to disentangle the effects of item ID and modality in session-based recommendation.
At the item level, DIMO introduces a co-occurrence representation schema to explicitly incorporate co-occurrence patterns into ID embeddings, and aligns different modalities (text, images) into a unified semantic space.
At the session level, DIMO presents a multi-view self-supervised disentanglement approach, including a proxy mechanism and counterfactual inference, to distinguish ID and modality effects without supervised signals.
Leveraging the disentangled causes, DIMO provides recommendations via causal inference and generates two types of explanations: co-occurrence template and feature template.
Extensive experiments on multiple real-world datasets demonstrate DIMO's consistent superiority over existing state-of-the-art methods in both accuracy and interpretability.
Іншою мовою
із вихідного контенту
arxiv.org
Ключові висновки, отримані з
by Xiaokun Zhan... о arxiv.org 04-22-2024
https://arxiv.org/pdf/2404.12969.pdfГлибші Запити