The author introduces METER, a novel lightweight vision transformer architecture for monocular depth estimation on embedded devices, achieving state-of-the-art results by balancing computational complexity and hardware constraints.
Monocular depth estimation using a novel lightweight vision transformer architecture, METER, achieves state-of-the-art results on embedded devices.