Gaga: Consistent 3D Segmentation from Inconsistent 2D Masks
Gaga reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models, eliminating label inconsistency across views through a 3D-aware memory bank.