The paper aims to address the limitations of existing self-supervised learning (SSL) methods in achieving true generality. It first provides a theoretical definition of generality in SSL, which involves discriminability, transferability, and generalization. Based on this, the paper proposes a σ-measurement to quantify the generality of SSL models.
To explicitly model generality into SSL, the authors propose a novel SSL framework called GeSSL. GeSSL learns general representations through a bi-level optimization process:
The second-level optimization is guided by a self-motivated target based on the proposed σ-measurement, which encourages the model to update towards the optimal direction for generality.
The paper provides theoretical analysis on the rationality of the task construction and the performance guarantee of GeSSL. Extensive experiments on various benchmarks, including unsupervised learning, semi-supervised learning, transfer learning, and few-shot learning, demonstrate the superior robustness and generalization of GeSSL compared to state-of-the-art SSL methods.
На другой язык
из исходного контента
arxiv.org
Дополнительные вопросы