LLM-RadJudge: Achieving Radiologist-Level Evaluation for Radiology Report Generation
Large language models can achieve radiologist-level performance in evaluating the clinical accuracy and relevance of generated radiology reports, providing an efficient and accessible alternative to manual assessment.