R2-Bench evaluates the robustness of referring perception models against perturbations like environmental noise, human-induced errors, and sensor limitations. The benchmark assesses performance across tasks like image segmentation, video object segmentation, audiovisual segmentation, and 3D mapping. It introduces R2-Agent, an LLM-based assistant for model evaluation automation.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Xiang Li,Kai... at arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.04924.pdfDeeper Inquiries