Efficient Training for Vision Transformers via Token Expansion
The proposed Token Expansion (ToE) method achieves consistent training acceleration for Vision Transformers by maintaining the integrity of the intermediate feature distribution through an "initialization-expansion-merging" pipeline.