Core Concepts
Understanding the impact of multivariate symmetrical uncertainty on feature selection.
Abstract
This content delves into the analysis of multivariate symmetrical uncertainty for feature selection. It explores the behavior of the measure through statistical simulation techniques, highlighting the effects of attributes, cardinalities, and sample size on the measure. The content also proposes a heuristic condition to preserve quality in the measure under different combinations of factors, providing a valuable criterion for dimensionality reduction.
Structure:
Introduction to Sample Representativeness
Theoretical Fundamentals
Analysis of Bias and Proposal
Results
Conclusions
Stats
The MSU is proposed as a generalization of the SU based on total correlation.
The MSU restricts its values to the range between 0 and 1.
The MSU can be applied to discrete and categorical variables.
Quotes
"In this thesis, through observation of results, it is proposed an heuristic condition that preserves good quality in the MSU under different combinations of these three factors, providing a new useful criterion to help drive the process of dimension reduction."