核心概念
COLE system simplifies graphic design generation through hierarchical task decomposition.
要約
The COLE system by Microsoft Research Asia and Peking University introduces a hierarchical generation framework for multi-layered and editable graphic design. The system addresses challenges in generating high-quality designs from vague intentions, supporting flexible editing based on user input. It breaks down the complex task into specialized models working collaboratively to produce cohesive final outputs. The system comprises fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs) tailored for various design tasks.
- Abstract: Graphic design evolution, role in advertising, demands of high-quality designs.
- Introduction: Advancements in natural image generation, redirection towards professional image generation.
- Our Approach: COLE framework overview, Design LLM, Text-to-Background & Text-to-Object models, Typography LMM, Multi-Layered SVG Editor & Renderer.
- Experiment: Assessment on DESIGNERINTENTION benchmark, comparison with state-of-the-art systems, ablation experiments.
- Conclusion: COLE's efficiency in graphic design generation through hierarchical task decomposition.
統計
In 1843, Henry Cole introduced the world’s first commercial Christmas card [36].
Our COLE system outperforms DALL·E3 in text fidelity and message conveyance among both non-designers and designers.