Temel Kavramlar
Leveraging the capabilities of Large Language Models (LLMs) to enhance the accuracy and efficiency of security patch localization (SPL) recommendation methods.
Özet
The paper introduces LLM-SPL, an innovative approach that integrates LLM-based features into a joint learning framework to improve SPL recommendations. The key highlights are:
-
Challenges in SPL:
- The complex and intricate content of CVEs and commits requires specialized knowledge for accurate comprehension.
- Vulnerabilities often require multiple distinct patches over time, a scenario not well addressed by existing SPL methods.
- Identifying the relationships among commits is crucial but highly challenging.
-
LLM Potential:
- LLMs exhibit exceptional capabilities in processing natural language, interpreting code, and possessing extensive domain knowledge in security and software.
- Experiments show LLMs can effectively comprehend CVEs and commits, as well as recognize relationships between them.
- However, directly applying LLMs to SPL is impractical due to the high false positive rate.
-
LLM-SPL Approach:
- Incorporates two LLM-based features into a joint learning framework:
- LLM's prediction of the relationship between CVEs and commits
- LLM-endorsed inter-commit relationship graph
- Utilizes an LLM-feedback technique to refine the recommendation model, significantly reducing computational costs.
-
Evaluation Results:
- LLM-SPL outperforms the state-of-the-art SPL method, VCMatch, in all metrics - Recall, NDCG, and Manual Effort.
- For vulnerabilities requiring multiple patches, LLM-SPL improves Recall by 22.83%, NDCG by 19.41%, and reduces manual effort by over 25% when checking up to the top 10 rankings.
İstatistikler
96% of 1,700 commercial codebases examined across 17 industries incorporate open source components.
21.04% of CVEs require multiple patches for complete resolution.
LLM-SPL reduces the estimated cost from 620,000 USD to 880 USD and processing time from a century to 3 days.
Alıntılar
"LLM-SPL effectively ranked the patches for 92.74% CVEs within the top 10 positions, simultaneously delivering high-quality rankings as evidenced by the NDCG metric, which reached a high value of 87.33%."
"For CVEs requiring multiple collaborated patches, LLM-SPL significantly improved Recall from 60.30% to 83.13% (a 22.83% increase), enhanced NDCG from 60.99% to 80.40% (a 19.41% increase), and reduced manual effort by over 25% when checking up to the top 10 rankings."