Horizontally Scalable Vision Transformer: Preserving Inductive Bias for Efficient Image Classification
A novel horizontally scalable vision transformer (HSViT) architecture that preserves the inductive bias from convolutional layers while reducing the number of layers and parameters, enabling efficient image classification on resource-constrained devices.