Fine-Grained Evaluation Capability in Language Models: Prometheus
The author argues that PROMETHEUS, an open-source LLM, can match GPT-4's evaluation capabilities when provided with reference materials. The approach involves training on diverse score rubrics to induce fine-grained evaluation.