SLCF-Net is a pioneering method that combines RGB images and sparse LiDAR scans to infer a 3D voxelized semantic scene. The model leverages Gaussian-decay Depth-prior Projection for feature projection and inter-frame feature propagation for temporal consistency. By integrating historical information, SLCF-Net excels in both accuracy and consistency metrics on the SemanticKITTI dataset. The model's performance surpasses other SSC baselines, showcasing its effectiveness in semantic scene completion tasks.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Helin Cao,Sv... a las arxiv.org 03-15-2024
https://arxiv.org/pdf/2403.08885.pdfConsultas más profundas