ARES is an automated framework for evaluating retrieval-augmented generation (RAG) systems along the dimensions of context relevance, answer faithfulness, and answer relevance.
FRAMES, a novel evaluation dataset, comprehensively tests the factuality, retrieval, and reasoning capabilities of retrieval-augmented generation (RAG) systems in a unified framework.