Core Concepts
A novel mutual information and ensemble-based feature ranking approach can effectively identify clinically relevant features to accurately classify different stages of clear cell renal cell carcinoma.
Abstract
The article presents a detailed experimentation to identify important features that can aid in diagnosing clear cell renal cell carcinoma (ccRCC) at different stages. The authors propose a novel mutual information and ensemble-based feature ranking approach that considers the order of features obtained from 8 popular feature selection methods.
The key highlights are:
- The ccRCC dataset is obtained from The Cancer Genome Atlas (TCGA) and contains a combination of clinical, histopathological, and demographic features.
- 8 feature selection methods, including 2 wrapper, 5 filter, and 1 embedded techniques, are used to initially rank the features.
- A positional table is formed from the ranking information obtained by these 8 methods, and the final ranking of features is determined by considering the importance of features across all the methods.
- The performance of the proposed feature ranking method is evaluated using 2 different classifiers (ANN and SVM).
- Experimental results show that the proposed method can achieve a higher accuracy (96.6% and 98.6% using SVM and NN, respectively) for classifying different stages of ccRCC with a reduced feature set compared to existing work.
- The proposed method was able to select 2 out of the 3 distinguishing features mentioned by the existing TNM system (size of tumor and metastasis status) as the top-most ones, establishing the efficacy of the approach.
Stats
The size of the tumor (feature 7) is an important indicator for classifying different stages of clear cell renal cell carcinoma.
The metastasis status (feature 9) is another key feature for distinguishing between the cancer stages.
The life span in days after diagnosis (feature 22) and the tumor grade G score (feature 0) also play significant roles in accurate stage classification.
Quotes
"Early and accurate detection of ccRCC is necessary to prevent further spreading of the disease in other organs."
"Out of 3 distinguishing features as mentioned by the existing TNM system (proposed by AJCC and UICC), our proposed method was able to select two of them (size of tumour, metastasis status) as the top-most ones."