VLLMs Leverage Common Sense Reasoning to Enhance Emotion Understanding in Context
Leveraging the common sense reasoning capabilities of Vision-and-Large-Language Models (VLLMs), this work proposes a novel two-stage approach to enhance emotion classification in visual context without introducing complex training pipelines.