SLCF-Net is a pioneering method that combines RGB images and sparse LiDAR scans to infer a 3D voxelized semantic scene. The model leverages Gaussian-decay Depth-prior Projection for feature projection and inter-frame feature propagation for temporal consistency. By integrating historical information, SLCF-Net excels in both accuracy and consistency metrics on the SemanticKITTI dataset. The model's performance surpasses other SSC baselines, showcasing its effectiveness in semantic scene completion tasks.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Helin Cao,Sv... alle arxiv.org 03-15-2024
https://arxiv.org/pdf/2403.08885.pdfDomande più approfondite