Brancas, R., Manquinho, V., & Martins, R. (2024). Combining Logic with Large Language Models for Automatic Debugging and Repair of ASP Programs. In Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence.
This paper introduces FormHe, a tool designed to address the challenges novice programmers face when debugging ASP code. The research aims to demonstrate the effectiveness of combining logic-based debugging techniques with LLMs for automated fault localization and program repair in ASP.
FormHe employs a multi-pronged approach to fault localization, leveraging Minimal Strongly Inconsistent Correction Subsets (MSICSs), line matching algorithms, and fine-tuned LLMs as classifiers. The repair module then utilizes either a fine-tuned LLM or a program mutation enumerator to generate and verify potential fixes. The system was evaluated using both real student submissions from a university-level Automated Reasoning course and synthetically generated buggy programs.
FormHe demonstrated high accuracy in fault localization, correctly identifying all faults in 85% of real student submissions and at least one fault in 94% of cases. The combined repair approach, utilizing both LLM and mutation-based techniques, successfully repaired 58% of incorrect submissions. Notably, the LLM-based repair significantly outperformed the mutation-based approach, highlighting the potential of LLMs in this domain.
The research concludes that integrating logic-based techniques with LLMs offers a promising avenue for automated debugging and repair of ASP programs. FormHe provides valuable assistance to novice programmers by pinpointing errors and suggesting corrections, ultimately facilitating the learning process.
This work contributes significantly to the field of automated program repair, particularly for declarative programming languages like ASP. FormHe's success in assisting novice programmers has the potential to improve educational outcomes and lower the barrier to entry for ASP and other declarative languages.
While FormHe shows promise, the researchers acknowledge the limitations of relying on synthetic data for training LLMs. Future work could explore methods to improve performance on real-world code and investigate the generalization of FormHe to other declarative programming paradigms.
Til et annet språk
fra kildeinnhold
arxiv.org
Viktige innsikter hentet fra
by Ricardo Bran... klokken arxiv.org 10-29-2024
https://arxiv.org/pdf/2410.20962.pdfDypere Spørsmål