The paper introduces RALF, a Retrieval-Augmented Layout Transformer, to improve content-aware layout generation. By retrieving nearest neighbor layouts based on input images, RALF generates high-quality layouts with less training data. The study evaluates RALF's performance in unconstrained and constrained tasks, showcasing its superiority over baselines in generating diverse yet plausible layouts that harmonize with given backgrounds.
RALF successfully addresses the challenges of limited training data in content-aware layout generation by incorporating retrieval augmentation. The model outperforms state-of-the-art approaches and demonstrates robust generalizability even in out-of-domain settings. Additionally, RALF excels in various constrained generation tasks, showcasing its effectiveness in generating layouts under user-specified constraints.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Daichi Horit... klo arxiv.org 03-05-2024
https://arxiv.org/pdf/2311.13602.pdfSyvällisempiä Kysymyksiä