3D Open-Vocabulary Panoptic Segmentation with Vision-Language Distillation
The core message of this paper is to present the first approach for 3D open-vocabulary panoptic segmentation in autonomous driving by leveraging large vision-language models and proposing novel loss functions for effective learning.