The paper proposes a novel framework called DIMO (Disentangling ID and MOdality effects) to disentangle the effects of item ID and modality in session-based recommendation.
At the item level, DIMO introduces a co-occurrence representation schema to explicitly incorporate co-occurrence patterns into ID embeddings, and aligns different modalities (text, images) into a unified semantic space.
At the session level, DIMO presents a multi-view self-supervised disentanglement approach, including a proxy mechanism and counterfactual inference, to distinguish ID and modality effects without supervised signals.
Leveraging the disentangled causes, DIMO provides recommendations via causal inference and generates two types of explanations: co-occurrence template and feature template.
Extensive experiments on multiple real-world datasets demonstrate DIMO's consistent superiority over existing state-of-the-art methods in both accuracy and interpretability.
Başka Bir Dile
kaynak içeriğinden
arxiv.org
Önemli Bilgiler Şuradan Elde Edildi
by Xiaokun Zhan... : arxiv.org 04-22-2024
https://arxiv.org/pdf/2404.12969.pdfDaha Derin Sorular