المفاهيم الأساسية
Dinomaly, a novel framework for multi-class unsupervised anomaly detection, achieves state-of-the-art performance by leveraging a simplified Transformer architecture with four key elements: foundation Transformers, a noisy bottleneck, linear attention, and loose reconstruction.
الإحصائيات
Dinomaly achieves image-level AUROC of 99.6%, 98.7%, and 89.3% on MVTec-AD, VisA, and Real-IAD datasets, respectively.
On MVTec-AD, Dinomaly outperforms previous state-of-the-art methods by 1.0% in AUROC, 0.2% in AP, and 1.2% in F1-max for image-level detection.
For pixel-level detection on MVTec-AD, Dinomaly surpasses previous best results by 0.7% in AUROC, 9.1% in AP, 7.7% in F1-max, and 1.6% in AUPRO.
Dinomaly exhibits scalability, with larger ViT architectures leading to improved performance.
Increasing input image size further enhances Dinomaly's performance, contrary to previous methods that experience degradation.
اقتباسات
"What I cannot create, I do not understand" — Richard Feynman
"Dropout is all you need."
"One man’s poison is another man’s meat"
"The tighter you squeeze, the less you have."