The paper introduces Natural Learning (NL), a novel machine learning algorithm that aims to achieve a high level of explainability and interpretability. NL is inspired by prototype theory from cognitive psychology, which suggests that people categorize objects based on their similarity to prototypical examples.
The key aspects of NL are:
NL discovers a single prototype sample for each class, along with the minimal set of features that characterize the prototype. This aligns with the principles of prototype theory, which states that people rely on sparse prototypes for categorization.
NL employs locality-sensitive hashing (LSH) to efficiently find the nearest neighbors of each sample, addressing the curse of dimensionality. It also uses a recursive feature pruning method to identify the core features of the prototypes.
The training process of NL is optimization-free and involves simple operations like nearest neighbor search and element-wise vector comparisons. This results in NL models that are extremely sparse, both in terms of the number of prototypes and the number of features used.
The prediction process of NL is also straightforward - it assigns a new sample to the class whose prototype it is closest to, based on the selected prototype features.
The paper presents an extensive empirical evaluation of NL on 17 benchmark datasets, including high-dimensional gene expression data and low-dimensional healthcare datasets. The results show that NL achieves performance comparable to state-of-the-art black-box models like deep neural networks and random forests in 40% of the cases, with only a 1-2% lower average accuracy. Importantly, NL models are highly interpretable, using only a few prototypes and features, and exhibit the lowest model variance among all classifiers.
The paper concludes by discussing the potential applications of NL in domains that prioritize interpretability and explainability, such as healthcare, finance, and criminal justice, where the existence of prototypical cases is common.
Para outro idioma
do conteúdo fonte
arxiv.org
Principais Insights Extraídos De
by Hadi Fanaee-... às arxiv.org 04-10-2024
https://arxiv.org/pdf/2404.05903.pdfPerguntas Mais Profundas