Kernkonzepte
The core message of this work is to efficiently tackle the ambiguity-induced false activation issue in both the class activation map generation and pseudo label refinement stages of weakly supervised semantic segmentation through uncertainty inference and affinity diversification.
Zusammenfassung
The authors propose a unified single-staged framework called UniA to address the ambiguity problem in weakly supervised semantic segmentation (WSSS). The key insights are:
- Uncertainty Inference:
- The authors model the feature extraction as a Gaussian distribution to capture the complex dependencies among ambiguous regions and avoid the bias towards unrelated objects.
- An uncertainty estimation is introduced to the class activation map (CAM) generation stage to suppress the false activation caused by ambiguity.
- A distribution loss is designed to supervise the probabilistic process and maintain the uncertainty level.
- A soft ambiguity masking strategy is proposed to incorporate the uncertainty into feature learning.
- Affinity Diversification:
- The authors observe that the prevailing affinity-based refinement techniques suffer from the ambiguity and tend to be over-smooth, leading to semantic errors.
- A mutual complementing refinement is proposed to rectify the ambiguous affinity while saving the most certain semantics from multiple inferred pseudo labels.
- A contrastive affinity loss is further designed to stably propagate the diversity among ambiguous regions into the feature representations.
Extensive experiments on PASCAL VOC, MS COCO, and medical ACDC datasets demonstrate the superiority of UniA in tackling ambiguity and generating high-quality semantic predictions, especially for objects with fuzzy boundaries.
Statistiken
The proposed UniA framework achieves 74.1% mIoU on the PASCAL VOC 2012 validation set, outperforming recent state-of-the-art single-staged and even multi-staged methods.
On the MS COCO 2014 validation set, UniA achieves 43.2% mIoU, significantly better than recent single-staged competitors.
On the medical ACDC 2017 dataset, UniA achieves 83.75% Dice Similarity Coefficient and 0.21 confusion ratio, outperforming other WSSS methods designed for both natural and medical scenarios.
Zitate
"The ambiguity-induced false activation issue in both stages of CAMs and refinement is formally reported in WSSS."
"We introduce the uncertainty estimation to this stage for the first time and make the feature representation robust against noisy regions."
"We design an affinity diversification module to make the category semantics distinctive from the unrelated regions."