Vision transformers with shifted window attention improve voxel 3D reconstruction accuracy.
Efficiently reconstruct 3D assets from single images using Gamba, leveraging Gaussian splatting and Mamba for speed and quality.
Vision transformers with shifted window attention improve voxel 3D reconstruction accuracy.