This paper presents a detailed review of the research progress in AI-based webshell detection methods. It categorizes the relevant studies into three stages: Start Stage, Initial Development Stage, and In-depth Development Stage, based on the timeline and technological development.
In the Start Stage, researchers focused on the preliminary exploration of AI-related algorithms for webshell detection, using simple convolutional neural networks, long short-term memory, and other techniques. The Initial Development Stage saw a surge of research, with researchers optimizing the entire detection pipeline, including data preprocessing, feature extraction, and classification. Methods in this stage employed a variety of techniques, such as abstract feature extraction, ensemble learning, and hybrid models.
The In-depth Development Stage, starting from late 2021, has witnessed the application of more advanced AI models, such as BERT-based language models and graph neural networks, to webshell detection tasks. Researchers have also explored new methodological paradigms, including few-shot learning, federated learning, and continual learning, to address the challenges in webshell detection.
The paper also discusses the critical issues and challenges faced by the existing methods, such as appropriate data representation, the trade-off between machine learning and deep learning, data imbalance, and dataset limitations. Finally, it predicts the future development trends in this field, including the use of few-shot learning, federated learning, continual learning, large language models, and novel methodological paradigms.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Mingrui Ma,L... às arxiv.org 05-02-2024
https://arxiv.org/pdf/2405.00066.pdfPerguntas Mais Profundas