Camera-LiDAR Fusion Transformer for Robust Semantic Segmentation in Autonomous Driving
A novel vision transformer-based network, CLFT, that employs an innovative progressive-assemble strategy to effectively fuse camera and LiDAR data for robust semantic segmentation in autonomous driving.