R2-Bench evaluates the robustness of referring perception models against perturbations like environmental noise, human-induced errors, and sensor limitations. The benchmark assesses performance across tasks like image segmentation, video object segmentation, audiovisual segmentation, and 3D mapping. It introduces R2-Agent, an LLM-based assistant for model evaluation automation.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Xiang Li,Kai... alle arxiv.org 03-11-2024
https://arxiv.org/pdf/2403.04924.pdfDomande più approfondite