Core Concepts
The author proposes RSBuilding, a foundation model designed to enhance cross-scene generalization and task universality in remote sensing image understanding. The approach integrates building extraction and change detection tasks within a unified framework.
Abstract
RSBuilding introduces a comprehensive model for interpreting buildings from remote sensing images, addressing the challenges of shared knowledge, complex scenes, and diverse resolutions. The model demonstrates robust performance in handling both building extraction and change detection tasks simultaneously.
Buildings are crucial components of geographic information databases, with intelligent interpretation playing a significant role in urban planning and management. Current methodologies often treat building extraction and change detection as separate tasks, limiting their generalizability across different scenes and scales.
The proposed RSBuilding model leverages a foundation approach to enhance cross-scene generalization capabilities and task universality. By integrating building extraction and change detection within a single framework, the model exhibits robust zero-shot generalization capabilities across various application scenarios.
RSBuilding was trained on a dataset comprising up to 245,000 images and validated on multiple datasets for building extraction and change detection. Experimental results confirm the model's ability to handle structurally distinct tasks concurrently with strong generalization capabilities.
Stats
Buildings constitute not only a significant proportion of man-made structures but also serve as a crucial component of geographic information databases.
The proposed RSBuilding model was trained on a dataset comprising up to 245,000 images.
Experimental results substantiate that RSBuilding can concurrently handle two structurally distinct tasks with robust zero-shot generalization capabilities.