Proposing a novel audio-visual method for compound expression recognition based on emotion probability fusion and rule-based decision-making.
Proposing a zero-shot approach for recognizing compound expressions using a visual language model integrated with CNN networks.