Core Concepts
This paper introduces ConDo, a novel domain adaptation framework designed to address the challenge of "confounded shift," where both covariate and label shifts occur simultaneously and are intertwined. ConDo aims to achieve general-purpose data backwards compatibility by minimizing the divergence between source and target conditional distributions, enabling the use of adapted data for various downstream tasks, including prediction and statistical analysis, even with pre-existing, non-updatable models.
Stats
The target distribution is Uniform[0, 8] and the source distribution is Uniform[4, 8].
The source distribution is a mixture of Gaussians 0.25N(5, 12) + 0.75N(0, 22), while the target is 0.75N(5, 12) + 0.25N(0, 22).
The ANSUR II dataset comprises 93 anthropometric measurements of 6068 military personnel.
A random subsample of 500 individuals with a 75%-25% (and a 25%-75%) male-female split was used for the source (and target) datasets.