Improving Code Editing with Natural Language Feedback: COFFEE-GYM, a Comprehensive Environment for Training and Evaluating Feedback Models
COFFEE-GYM, a comprehensive reinforcement learning environment, addresses the challenges in training open-source feedback models for improving code editing by providing a high-quality dataset (COFFEE) and a reliable reward function (COFFEEEVAL).