The study aimed to identify and analyze the cyber-based security threats faced by Bangladeshi netizens. A questionnaire was created and data was collected from both victims and non-victims of cyber attacks. The dataset was expanded using data augmentation and feature importance was analyzed using the Chi-squared test. Backward elimination was used to select the most important 20 features.
Several machine learning classification algorithms were trained and evaluated on the dataset. The Random Forest (RF) classifier trained with 20 features achieved the highest accuracy of 95.95%. The RF classifier also had the best combination of accuracy and AUC values compared to other models.
To identify key risk factors, the selected 20 features were decomposed into 38 factors. Association rule mining using the Apriori algorithm was performed to extract rules with high confidence (above 80%) that indicate the relationship between the factors and the victim class. The top 10 association rules provide insights into the key risk factors leading to cyber attacks.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Fatama Tuz J... at arxiv.org 04-02-2024
https://arxiv.org/pdf/2404.00068.pdfDeeper Inquiries