A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
The authors propose a Unified Multi-modal Image Aesthetic Assessment (UNIAA) framework, including a Multi-modal Large Language Model (MLLM) named UNIAA-LLaVA and a comprehensive benchmark named UNIAA-Bench, to align with the human aesthetic process and achieve good results in multiple aesthetic subtasks.