Pairwise Comparison Approach Improves Open-Domain Dialogue Evaluation
A novel dialogue evaluation metric, PAIREVAL, assesses responses by comparing their quality against a limited number of comparison responses, outperforming previous evaluation metrics.