核心概念
A multi-level supervised building reconstruction network (MLS-BRN) that can flexibly utilize training samples with different annotation levels to achieve better reconstruction results in an end-to-end manner.
摘要
The paper proposes a multi-level supervised building reconstruction network (MLS-BRN) that can effectively utilize training samples with different annotation levels, including building footprint only, footprint and building height, and footprint, offset, and building height, to achieve better 3D building reconstruction performance.
Key highlights:
- Designed two new modules, Pseudo Building Bbox Calculator (PBC) and Roof-Offset guided Footprint Extractor (ROFE), to alleviate the demand for full 3D supervision.
- Introduced new tasks and training strategies for different types of samples to leverage the large-scale 2D footprint annotations and varying levels of 3D annotations.
- Conducted experiments on several public and new datasets, demonstrating that the proposed MLS-BRN achieves competitive performance using much fewer 3D-annotated samples and significantly improves the footprint extraction and 3D reconstruction compared to state-of-the-art methods.
统计
Building height can be estimated from the image-wise off-nadir angle, offset angle, and the instance-wise building height using the equation: ||⃗vb||2 = hb × sI × tan(θI) × [cos(φI), sin(φI)].
The off-nadir angle prediction task has a mean absolute error (MAE) of 1.22 degrees when trained on BN100.
The offset angle prediction task has an MAE of 9.92 degrees when trained on BN100.
引用
"To alleviate the demand on 3D annotations and enhance the building reconstruction performance, we design new tasks regarding the meta information of off-nadir images and two new modules, i.e., Pseudo Building Bbox Calculator and Roof-Offset guided Footprint Extractor, as well as a new training strategy based on different types of samples."
"Experimental results on several public and new datasets demonstrate that our method achieves competitive performance when only using a small proportion of 3D-annotated samples, and significantly improves the building segmentation and height estimation performance compared with current state-of-the-art."