Evalverse: A Unified and Expandable Library for Comprehensive Evaluation of Large Language Models
Evalverse is a novel library that streamlines the evaluation of Large Language Models (LLMs) by unifying disparate evaluation tools into a single, user-friendly framework, enabling both researchers and practitioners to comprehensively assess LLM performance.