Leveraging Unlabeled Image-Pointcloud Pairs to Improve 3D Object Classification without Labels
Leveraging unlabeled image-pointcloud pairs, the proposed Cross-Modal Self-Training framework can significantly improve the performance of zero-shot 3D object classification models without requiring any class-level annotations.