Geometrically-Driven Aggregation for Enhancing Zero-Shot 3D Point Cloud Understanding
Geometrically-driven aggregation of vision-language model representations can effectively improve the quality of zero-shot 3D point cloud understanding across various downstream tasks.