Core Concepts
PEEB is an explainable and editable image classifier that outperforms CLIP-based classifiers in both zero-shot and supervised learning settings.
Abstract
PEEB introduces a novel approach to image classification by utilizing text descriptors for visual parts, providing transparency in decision-making. The model surpasses existing methods in fine-grained classification tasks, showcasing superior performance and adaptability. PEEB's reliance on accurate descriptors highlights its robustness and versatility across various datasets.
Stats
CLIP-based classifiers rely heavily on class names in the prompt, impacting accuracy significantly when replaced with uncommon alternatives.
PEEB outperforms CLIP-based classifiers by +8 to +29 points in bird classification across different datasets.
Compared to concept bottleneck models, PEEB excels in both zero-shot and supervised learning settings.
Quotes
"CLIP-based classifiers depend mostly on class names in the prompt."
"PEEB outperforms the baselines across all three datasets."
"PEEB exhibits superior GZSL performance compared to recent text concept-based approaches."