Reference-free metrics show higher correlation with human judgment and sensitivity to language quality deficiencies compared to reference-based metrics.