Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems: The Importance of Dialogue Context
The availability of dialogue context significantly influences the quality and consistency of crowdsourced evaluation labels for task-oriented dialogue systems.