Evaluating the Effectiveness of Large Language Models for Generating Documentation for Legacy Code in MUMPS and Assembly Language
While large language models (LLMs) show promise for generating useful documentation for legacy code in languages like MUMPS and Assembly Language, current automated metrics struggle to accurately assess the quality of this documentation, highlighting the need for better evaluation methods.