Core Concepts
The core message of this article is to propose a new method for explaining neural network image classification by decomposing the input image into a class-agnostic part and a class-distinct part. This provides a radically different way of explaining classification compared to standard heatmap-based approaches.
Abstract
The article presents a new method called Decomposition-based Explainable AI (DXAI) for explaining neural network image classification. The key idea is to decompose the input image into two additive parts: a class-agnostic part that does not contain class-specific information, and a class-distinct part that holds the discriminative features responsible for the classification.
The authors argue that standard heatmap-based XAI methods are less informative in scenarios where the classification relies on dense, global, and additive features, such as color or texture. In contrast, the DXAI decomposition can better explain such cases.
The authors formulate the DXAI problem as an optimization to find the closest class-agnostic image to the input. They propose an approximate solution using style transfer GANs, where the class-distinct part is isolated in the first generator branch, while the subsequent branches generate the class-agnostic components.
The training process encourages the generators to isolate the class-distinct features, using an α-blending mechanism and various loss functions. The authors show qualitative and quantitative results on several datasets, demonstrating the advantages of DXAI over heatmap-based explanations, especially for classification tasks relying on additive and global features.
The authors also discuss limitations of their approach, such as the lack of a natural pixel-wise importance ranking, and suggest potential improvements using diffusion-based generative models.
Stats
The article does not provide specific numerical data or metrics, but rather focuses on qualitative comparisons and examples.
Quotes
"We propose a new way to explain and to visualize neural network classification through a decomposition-based explainable AI (DXAI). Instead of providing an explanation heatmap, our method yields a decomposition of the image into class-agnostic and class-distinct parts, with respect to the data and chosen classifier."
"The class-agnostic part ideally is composed of all image features which do not posses class information, where the class-distinct part is its complementary."
"Our approach assumes a membership logic, such that each region is potentially a superposition of image features common to many classes and ones which are class-specific."