A Spike Transformer Network for Accurate Depth Estimation from Event Cameras via Cross-Modality Knowledge Distillation
A novel spike transformer network that leverages cross-modality knowledge distillation from a large vision foundation model to achieve accurate depth estimation from event camera data.