Enhancing Unsupervised Domain Adaptation through Vision Transformer-based Adversarial Training
Employing the Vision Transformer (ViT) as a plug-and-play feature extractor in adversarial domain adaptation can significantly improve the transferability and discriminability of learned domain-invariant features.