This research paper introduces Lingma SWE-GPT, a series of open-source large language models (LLMs) specifically designed for automated software improvement. The authors argue that existing LLMs, while proficient in code generation, lack a deep understanding of the dynamic and iterative nature of real-world software development processes. This limitation stems from their training on static code data, which fails to capture the reasoning, tool utilization, and interactive problem-solving inherent in software engineering.
The paper addresses two key challenges in the field: the over-reliance on closed-source models, which limits accessibility and raises data privacy concerns, and the lack of comprehensive development process data in LLM training. To overcome these challenges, the authors propose a novel development process-centric training approach that simulates the software improvement process through three stages: repository understanding, fault localization, and patch generation.
In the repository understanding stage, Lingma SWE-GPT analyzes the repository structure, navigates the codebase, and identifies relevant files, classes, and functions. The fault localization stage pinpoints potential problem areas within the codebase using specialized search APIs and context analysis. Finally, the patch generation stage generates and applies patches, incorporating git-related operations and lint tools for validation and debugging.
The authors demonstrate the effectiveness of their approach through extensive evaluations on SWE-bench Verified and SWE-bench Lite, two challenging benchmarks comprising real-world GitHub issues. The results show that Lingma SWE-GPT 72B achieves a 30.20% success rate on SWE-bench Verified, surpassing existing open-source models and approaching the performance of leading closed-source alternatives. Notably, the smaller Lingma SWE-GPT 7B model also exhibits promising results, highlighting its potential for resource-constrained scenarios.
The paper concludes by emphasizing the significance of Lingma SWE-GPT as a viable open-source alternative for automated software improvement, offering comparable performance to closed-source models while addressing concerns about accessibility, customization, and data privacy. The authors suggest that future research should focus on further enhancing the model's capabilities in handling complex software systems and exploring its potential in other software engineering tasks.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Yingwei Ma, ... at arxiv.org 11-04-2024
https://arxiv.org/pdf/2411.00622.pdfDeeper Inquiries