The COLE system by Microsoft Research Asia and Peking University introduces a hierarchical generation framework for multi-layered and editable graphic design. The system addresses challenges in generating high-quality designs from vague intentions, supporting flexible editing based on user input. It breaks down the complex task into specialized models working collaboratively to produce cohesive final outputs. The system comprises fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs) tailored for various design tasks.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Peidong Jia,... às arxiv.org 03-20-2024
https://arxiv.org/pdf/2311.16974.pdfPerguntas Mais Profundas