The COLE system by Microsoft Research Asia and Peking University introduces a hierarchical generation framework for multi-layered and editable graphic design. The system addresses challenges in generating high-quality designs from vague intentions, supporting flexible editing based on user input. It breaks down the complex task into specialized models working collaboratively to produce cohesive final outputs. The system comprises fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs) tailored for various design tasks.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Peidong Jia,... alle arxiv.org 03-20-2024
https://arxiv.org/pdf/2311.16974.pdfDomande più approfondite