Enhancing 3D vision understanding through a unified VisionGPT-3D framework.
Large language models and state-of-the-art vision models are integrated in the VisionGPT-3D framework to enhance 3D vision understanding.