Comprehensive Survey on Language Models for Code Processing: Unifying NLP and Software Engineering Perspectives
This work provides a comprehensive survey of recent advancements in code processing with language models, covering a wide range of models, tasks, datasets, and related works. It highlights the historical transition from statistical and RNN-based models to pretrained Transformers and large language models, drawing parallels to the progress in natural language processing. The survey also discusses the integration of code-specific features and techniques adapted from NLP, as well as the latest applications of language models in software development.