The study introduces CLEAR, a unified network for person attribute recognition and retrieval tasks. It leverages cross-transformers and pre-trained language models to address both tasks efficiently. The study demonstrates the effectiveness of the CLEAR model on five benchmarks, achieving competitive results and outperforming other models. The model's architecture, training strategy, and evaluation results are detailed, showcasing its superior performance.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Doanh C. Bui... في arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.06119.pdfاستفسارات أعمق