SLCF-Net is a pioneering method that combines RGB images and sparse LiDAR scans to infer a 3D voxelized semantic scene. The model leverages Gaussian-decay Depth-prior Projection for feature projection and inter-frame feature propagation for temporal consistency. By integrating historical information, SLCF-Net excels in both accuracy and consistency metrics on the SemanticKITTI dataset. The model's performance surpasses other SSC baselines, showcasing its effectiveness in semantic scene completion tasks.
לשפה אחרת
מתוכן המקור
arxiv.org
תובנות מפתח מזוקקות מ:
by Helin Cao,Sv... ב- arxiv.org 03-15-2024
https://arxiv.org/pdf/2403.08885.pdfשאלות מעמיקות