Multimodal Dataset for Esports Game Situation Understanding and Commentary Generation
This paper introduces a new multimodal dataset, Game-MUG, that combines game event logs, caster's speech transcripts, audience chats, and game audio to enable comprehensive understanding of esports game situations and generate engaging commentaries.