The COLE system by Microsoft Research Asia and Peking University introduces a hierarchical generation framework for multi-layered and editable graphic design. The system addresses challenges in generating high-quality designs from vague intentions, supporting flexible editing based on user input. It breaks down the complex task into specialized models working collaboratively to produce cohesive final outputs. The system comprises fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs) tailored for various design tasks.
Til et annet språk
fra kildeinnhold
arxiv.org
Viktige innsikter hentet fra
by Peidong Jia,... klokken arxiv.org 03-20-2024
https://arxiv.org/pdf/2311.16974.pdfDypere Spørsmål