Concepts de base
A hybrid deep learning model combining Artificial Neural Network (ANN) and Long Short-Term Memory (LSTM) achieves high accuracy in detecting phishing websites.
Résumé
The paper presents a study on detecting phishing websites using machine learning and deep learning approaches. The dataset consists of 10,000 instances, with 5,000 phishing websites and 5,000 legitimate websites, and 48 features.
The authors evaluate the performance of five machine learning models (decision tree, k-nearest neighbor, naive Bayes, logistic regression, and SVM) and four deep learning models (ANN, LSTM, and a proposed hybrid model ANN-LSTM). The key findings are:
- The proposed hybrid model ANN-LSTM achieves the highest accuracy of 98%, outperforming the other models.
- Logistic regression and ANN also perform well, but not as well as the combined ANN-LSTM model.
- The k-nearest neighbor (KNN) classifier has the lowest accuracy of 74% due to the high computational cost of using a large number of neighbors (k=100).
- The authors also compare the proposed model's performance with existing models, and the ANN-LSTM model shows the best accuracy.
The paper demonstrates the effectiveness of using a hybrid deep learning approach for the task of phishing website detection, providing a robust solution to this cybersecurity challenge.
Stats
The dataset consists of 10,000 instances, with 5,000 phishing websites and 5,000 legitimate websites.
The dataset has 48 features, including URL-based, content-based, and other characteristics of the websites.
Citations
"Our proposed hybrid empirical method performs better than other models with 98 percent accuracy and k-Nearest Neighbor performs poorly with an accuracy of 74 percent because the lowest number of k=100 using the large numbers of k is computationally expensive to get the result."