Core Concepts
A novel Continual Unsupervised Domain Adaptation (CONDA) approach that allows a semantic segmentation model to continuously learn and adapt to new unlabeled target domains while maintaining performance on previous domains, without requiring access to the original training data.
Abstract
The paper presents a novel Continual Unsupervised Domain Adaptation (CONDA) approach for semantic scene segmentation in self-driving cars. The key insights are:
Formulation of the continual domain adaptation problem in semantic segmentation by regularizing the distribution shift of predictions between source and target domains to avoid catastrophic forgetting.
Derivation of a Bijective Maximum Likelihood loss to measure the distribution shift without requiring access to the source training data during the adaptation phase.
Design of a multi-scale bijective network architecture to effectively model the distribution of source predictions.
Experiments on the benchmark of GTA5 → Cityscapes → IDD → Mapillary demonstrate that the proposed CONDA approach outperforms prior unsupervised domain adaptation methods and achieves state-of-the-art results in the continual learning setting.
Stats
The paper reports the following key metrics:
On the GTA5 → Cityscapes → IDD → Mapillary benchmark, the proposed CONDA approach achieves a higher mIoU than the previous baseline by +1.9%.
Compared to the baseline, CONDA improves the per-class IoU by 14.1% for "object", 8.7% for "sky", and 2.9% for "vehicle" on the Cityscapes dataset.
On the IDD dataset, CONDA improves the baseline mIoU by 2.6%, with per-class improvements of 18.3% for "constr", 19.0% for "object", and 1.2% for "sky".
On the Mapillary dataset, CONDA improves the baseline mIoU by 1.2%, with per-class improvements of 0.1% for "flat", 1.6% for "constr", 3.0% for "object", 0.3% for "nature", 1.0% for "sky", and 2.8% for "vehicle".
Quotes
"Our proposed approach is designed without the requirement of accessing previous training data."
"Minimizing the Bijective Maximum Likelihood loss also imposes the distance between two distributions of source predictions and target predictions."
"Experiments on the benchmark of GTA5 → Cityscapes → IDD → Mapillary have shown the advanced performance of the proposed CONDA method."