M2UGen: Multi-modal Music Understanding and Generation with Large Language Models
The author introduces the M2UGen model, utilizing large language models for music understanding and multi-modal music generation. The approach aims to enhance user experience in music-related artistic creation.