Enhancing Diverse 3D Object Detection in Autonomous Driving through Language-Driven Active Learning
A language-driven active learning framework, VisLED, that leverages vision-language embeddings to efficiently query diverse and informative data samples, enhancing the model's ability to detect underrepresented or novel objects in autonomous driving scenarios.