Keskeiset käsitteet
Existing 3D-VL models struggle with natural language variations, highlighting the need for improved language robustness.
Tilastot
"The chair is black with wheels. It is to the right of the desk." - Original sentence used for evaluation.
"Even the state-of-the-art 3D-LLM fails to understand some variants of the same sentences." - Mention of model failure.
Lainaukset
"The fusion module is biased towards the training dataset, rather than genuinely understanding the semantics of natural language."
"Existing 3D-VL models exhibit sensitivity to styles of language input, struggling with sentences written in different variants."