The COLE system by Microsoft Research Asia and Peking University introduces a hierarchical generation framework for multi-layered and editable graphic design. The system addresses challenges in generating high-quality designs from vague intentions, supporting flexible editing based on user input. It breaks down the complex task into specialized models working collaboratively to produce cohesive final outputs. The system comprises fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs) tailored for various design tasks.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Peidong Jia,... a las arxiv.org 03-20-2024
https://arxiv.org/pdf/2311.16974.pdfConsultas más profundas