Belangrijkste concepten
One-class classifiers can be trained to implement the likelihood test, a statistically optimal decision rule for null hypothesis testing when the alternative hypothesis is unknown.
Samenvatting
The content discusses the problem of deciding between two alternative probability density functions (pdfs) P0 and P1 based on an observed sample, when only P0 is known. This scenario arises in security contexts where the attacker's behavior is unknown.
The authors propose training machine learning models, specifically neural networks (NNs) and least-squares support vector machines (LS-SVMs), to operate as the likelihood test (LT). The key ideas are:
Generate an artificial dataset of samples uniformly distributed over the domain of the positive class (P0) and train the NN or LS-SVM as a two-class classifier using this dataset. The trained model then implements the LT.
Derive a modified stochastic gradient descent (SGD) algorithm for training the NN that does not require the artificial dataset, but still converges to the LT.
Prove that the one-class LS-SVM with suitable kernels also converges to the LT as the training dataset size increases.
Show that the widely used autoencoder classifier generally does not provide the LT.
The authors demonstrate the equivalence of the proposed one-class classifiers to the LT through numerical results on Gaussian, Gaussian mixture, and finite input space datasets.
Statistieken
The content does not provide any specific numerical data or statistics. It focuses on the theoretical analysis and derivation of the proposed techniques.