A novel diffusion-based generative model, DifFUSER, is proposed to enhance multi-modal fusion for improved 3D object detection and BEV map segmentation performance, leveraging the denoising property of diffusion models.
効率的かつ効果的な3D知覚モデルの開発が重要である。