Kernkonzepte
Approx-FIRAL is a scalable active learning algorithm that dramatically accelerates the original FIRAL algorithm without compromising accuracy, enabling efficient active learning for large-scale datasets.
Zusammenfassung
The content presents Approx-FIRAL, a scalable active learning algorithm for multiclass classification using logistic regression. The original FIRAL algorithm was shown to outperform state-of-the-art methods, but suffered from high computational and storage complexity.
To address these challenges, the authors propose several key innovations in Approx-FIRAL:
- Exploiting the structure of the Hessian matrix to enable fast matrix-vector multiplications and an effective preconditioner for the conjugate gradient solver in the RELAX step.
- Introducing a modified ROUND step that only requires block-diagonal matrix operations, significantly reducing the computational and storage requirements.
- Developing a parallel GPU-accelerated implementation that achieves strong and weak scaling on up to 12 GPUs.
The accuracy tests demonstrate that Approx-FIRAL matches the performance of the original FIRAL algorithm, while being orders of magnitude faster. The authors showcase the scalability of Approx-FIRAL on large-scale datasets like ImageNet, which were intractable for the original FIRAL.
Statistiken
The authors report the following key metrics:
Storage complexity of Approx-FIRAL: O(n(d + c) + cd^2)
Computational complexity of Approx-FIRAL RELAX step: O(nrelax ncd(d + nCGs))
Computational complexity of Approx-FIRAL ROUND step: O(bncd^2)
Zitate
"Approx-FIRAL demonstrates approximately 29 times faster performance than Exact-FIRAL for the ImageNet-50 dataset, and about 177 times faster for the Caltech-101 dataset."
"Approx-FIRAL outperforms other methods like Random, K-means, and Entropy in the active learning test results, and maintains a consistent performance level across both balanced and imbalanced datasets."