Основні поняття
Shap-select is a new feature selection framework that improves the performance of machine learning models by combining SHAP values with statistical significance testing during the model training process.
Статистика
The dataset contains 284,807 transactions, of which 492 are labeled as fraudulent.
The data was split into train 0.60, validation 0.20 and test 0.20 sets.
shap-select selected 6 features with a runtime of 21 seconds.
HISEL selected all 30 features with a runtime of 109 seconds.
RFE selected 15 features with a runtime of 12.9 seconds.
Boruta selected 11 features with a runtime of 95.8 seconds.