The authors propose a Unified Multi-modal Image Aesthetic Assessment (UNIAA) framework, including a Multi-modal Large Language Model (MLLM) named UNIAA-LLaVA and a comprehensive benchmark named UNIAA-Bench, to align with the human aesthetic process and achieve good results in multiple aesthetic subtasks.


coremsg

a-unified-multi-modal-image-aesthetic-assessment-baseline-and-benchmark


A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark