The author introduces Atrous Attention, a fusion of regional and sparse attention, inspired by atrous convolution, to balance local and global information in vision transformers.
Atrous Attention in ACC-ViT enhances global context and hierarchical relations in vision transformers.
Atrous Attention in ACC-ViT enhances global context and hierarchical relations in vision transformers.