The key highlights and insights from the content are:
ControlCity is a multimodal diffusion model that generates high-resolution building footprint data by integrating image, text, and metadata inputs from OpenStreetMap and other sources.
The proposed method achieves state-of-the-art performance, reducing FID error by 71.01% and increasing MIoU by 38.46% compared to existing approaches across 22 global cities.
ControlCity demonstrates strong generalization capabilities, enabling effective urban morphology transfer and zero-shot city generation across different regions.
The innovative integration of image, text, and metadata inputs allows for the generation of refined building footprints, addressing the quality asymmetry in VGI-based urban data.
The model is highly applicable to urban planning tasks, including morphology analysis and spatial data completeness assessment, providing precise insights into complex urban structures.
Vers une autre langue
à partir du contenu source
arxiv.org
Questions plus approfondies