The article introduces the CSQA-Net, focusing on improving fine-grained visual categorization by addressing the quality of extracted visual representations. The network includes modules like MPMSCA and MLSQE to capture discriminative features and evaluate semantic quality progressively. Experiments show superior performance on popular FGVC datasets.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Qin Xu,Siton... a las arxiv.org 03-18-2024
https://arxiv.org/pdf/2403.10298.pdfConsultas más profundas