Factcheck-Bench: A Comprehensive Benchmark for Evaluating Automatic Fact-Checking Systems on Large Language Model Outputs
Factcheck-Bench is a fine-grained annotation framework and benchmark for evaluating the performance of automatic fact-checking systems on the outputs of large language models (LLMs). It encompasses detailed labeling of factual claims, evidence retrieval and stance detection, claim correction, and response revision.