The key highlights and insights from the content are:
ControlCity is a multimodal diffusion model that generates high-resolution building footprint data by integrating image, text, and metadata inputs from OpenStreetMap and other sources.
The proposed method achieves state-of-the-art performance, reducing FID error by 71.01% and increasing MIoU by 38.46% compared to existing approaches across 22 global cities.
ControlCity demonstrates strong generalization capabilities, enabling effective urban morphology transfer and zero-shot city generation across different regions.
The innovative integration of image, text, and metadata inputs allows for the generation of refined building footprints, addressing the quality asymmetry in VGI-based urban data.
The model is highly applicable to urban planning tasks, including morphology analysis and spatial data completeness assessment, providing precise insights into complex urban structures.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Fangshuo Zho... a las arxiv.org 09-26-2024
https://arxiv.org/pdf/2409.17049.pdfConsultas más profundas