This paper presents a novel XAI framework that relaxes the constraint of requiring the causal graph to be known, which is a limitation of previous methods like LEWIS. The proposed framework leverages counterfactual probabilities and additional prior information on causal structure to integrate a causal graph estimated through causal discovery methods and a black-box classification model.
The key highlights of the study are:
Analysis of the effects of causal structures on explanation scores and proposal of useful prior information on the causal structure to determine the Nesuf score.
Numerical experiments using artificial data to demonstrate the possibility of estimating the global explanatory score and the order of the true feature importance even if the causal graph is not fully known.
Application of the proposed method to real-world credit rating data from Shiga Bank, Japan, showing the effectiveness of the approach when the causal graph is unknown.
The experiments showed that incorporating prior information on the causal structure, such as the target variable having a direct parent-child relationship with all explanatory variables or being the sink variable, can improve the estimation of the Nesuf score compared to the case where no causal graph is assumed. The results indicate that the proposed framework can provide useful explanations even when the causal graph is unknown, by leveraging causal discovery methods.
翻译成其他语言
从原文生成
arxiv.org
更深入的查询