Khái niệm cốt lõi
UniMEEC proposes a unified framework to jointly model emotion recognition and emotion-cause pair extraction, leveraging the complementarity and causality between emotion and emotion cause.
Tóm tắt
The paper presents UniMEEC, a Unified Multimodal Emotion recognition and Emotion-Cause analysis framework, to explore the feasibility and effectiveness of jointly modeling emotion and its underlying emotion causes.
Key highlights:
- UniMEEC reformulates the Multimodal Emotion Recognition in Conversation (MERC) and Multimodal Emotion-Cause Pair Extraction (MECPE) tasks as two mask prediction problems, enhancing the interaction between emotion and cause.
- UniMEEC employs modality-specific prompt learning (MPL) to probe modality-specific knowledge from pre-trained language models and share prompt learning among modalities.
- UniMEEC introduces a task-specific hierarchical context aggregation (THC) module to capture the contexts oriented to specific tasks.
- Experiments on benchmark datasets IEMOCAP, MELD, ConvECPE, and ECF demonstrate that UniMEEC consistently outperforms state-of-the-art methods on both MERC and MECPE tasks.
- The results verify the effectiveness of the unified framework in addressing emotion recognition and emotion-cause pair extraction.
Thống kê
IEMOCAP dataset contains 7,532 samples, each labeled with six emotions.
MELD dataset contains 13,707 video clips of multi-party conversations, with labels following Ekman's six universal emotions.
ConvECPE dataset is constructed based on IEMOCAP, containing 7,433 utterances with emotion-cause pair annotations.
ECF dataset is constructed based on MELD, containing 13,509 utterances with emotion-cause pair annotations.
Trích dẫn
"Emotions are the expression of affect or feelings; responses to specific events, thoughts, or situations are known as emotion causes. Both are like two sides of a coin, collectively describing human behaviors and intents."
"Separately training MERC and MECPE can result in potential challenges in integrating the two tasks seamlessly in real-world application scenarios."