Core Concepts
Harmformer, a novel vision transformer architecture, leverages harmonic networks to achieve continuous roto-translation equivariance, outperforming previous equivariant transformers and competing with convolution-based models.
Karella, T., Harmanec, A., Kotera, J., Blažek, J., & Šroubek, F. (2024). Harmformer: Harmonic Networks Meet Transformers for Continuous Roto-Translation Equivariance. arXiv preprint arXiv:2411.03794.
This paper introduces Harmformer, a novel vision transformer architecture designed to achieve continuous roto-translation equivariance by integrating harmonic networks into the transformer architecture. The authors aim to demonstrate Harmformer's superior performance compared to existing equivariant transformers and its ability to compete with convolution-based equivariant networks.