CLEAR: Cross-Transformers with Pre-trained Language Model for Person Attribute Recognition and Retrieval
A unified CLEAR model utilizing cross-transformers and pre-trained language models achieves state-of-the-art performance in person attribute recognition and retrieval tasks.