HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View
Building Extraction and Height Estimation
- URL: http://arxiv.org/abs/2308.05387v1
- Date: Thu, 10 Aug 2023 07:03:32 GMT
- Title: HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View
Building Extraction and Height Estimation
- Authors: Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang
Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang
- Abstract summary: We propose a Height-hierarchy Guided Dual-decoder Network (HGDNet) to estimate building height.
Under the guidance of synthesized discrete height-hierarchy nDSM, auxiliary height-hierarchical building extraction branch enhance the height estimation branch.
Additional two-stage cascade architecture is adopted to achieve more accurate building extraction.
- Score: 13.09940764764909
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unifying the correlative single-view satellite image building extraction and
height estimation tasks indicates a promising way to share representations and
acquire generalist model for large-scale urban 3D reconstruction. However, the
common spatial misalignment between building footprints and
stereo-reconstructed nDSM height labels incurs degraded performance on both
tasks. To address this issue, we propose a Height-hierarchy Guided Dual-decoder
Network (HGDNet) to estimate building height. Under the guidance of synthesized
discrete height-hierarchy nDSM, auxiliary height-hierarchical building
extraction branch enhance the height estimation branch with implicit
constraints, yielding an accuracy improvement of more than 6% on the DFC 2023
track2 dataset. Additional two-stage cascade architecture is adopted to achieve
more accurate building extraction. Experiments on the DFC 2023 Track 2 dataset
shows the superiority of the proposed method in building height estimation
({\delta}1:0.8012), instance extraction (AP50:0.7730), and the final average
score 0.7871 ranks in the first place in test phase.
Related papers
- RoIPoly: Vectorized Building Outline Extraction Using Vertex and Logit Embeddings [5.093758132026397]
We propose a novel query-based approach for extracting building outlines from aerial or satellite imagery.
We formulate each polygon as a query and constrain the query attention on the most relevant regions of a potential building.
We evaluate our method on the vectorized building outline extraction dataset CrowdAI and the 2D floorplan reconstruction dataset Structured3D.
arXiv Detail & Related papers (2024-07-20T16:12:51Z) - SEED: A Simple and Effective 3D DETR in Point Clouds [72.74016394325675]
We argue that the main challenges are challenging due to the high sparsity and uneven distribution of point clouds.
We propose a simple and effective 3D DETR method (SEED) for detecting 3D objects from point clouds.
arXiv Detail & Related papers (2024-07-15T14:21:07Z) - PointHR: Exploring High-Resolution Architectures for 3D Point Cloud
Segmentation [77.44144260601182]
We explore high-resolution architectures for 3D point cloud segmentation.
We propose a unified pipeline named PointHR, which includes a knn-based sequence operator for feature extraction and a differential resampling operator.
To evaluate these architectures for dense point cloud analysis, we conduct thorough experiments using S3DIS and ScanNetV2 datasets.
arXiv Detail & Related papers (2023-10-11T09:29:17Z) - Data Fusion for Multi-Task Learning of Building Extraction and Height
Estimation [0.0]
This paper attempts a multitask-learning method of building extraction and height estimation using both optical and radar satellite imagery.
Contrary to the initial goal of multitask learning, this paper reports the individual implementation of the building extraction and height estimation under constraints.
arXiv Detail & Related papers (2023-08-05T22:16:19Z) - Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy
Dichotomous Image Segmentation [48.995367430746086]
High-accuracy Dichotomous Image rendering (DIS) aims to pinpoint category-agnostic foreground objects from natural scenes.
We introduce a novel Unite-Divide-Unite Network (UDUN) that restructures and bipartitely arranges complementary features to boost the effectiveness of trunk and structure identification.
Using 1024*1024 input, our model enables real-time inference at 65.3 fps with ResNet-18.
arXiv Detail & Related papers (2023-07-26T09:04:35Z) - Semi-supervised Learning from Street-View Images and OpenStreetMap for
Automatic Building Height Estimation [59.6553058160943]
We propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OpenStreetMap data.
The proposed method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters.
The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data.
arXiv Detail & Related papers (2023-07-05T18:16:30Z) - A CNN regression model to estimate buildings height maps using
Sentinel-1 SAR and Sentinel-2 MSI time series [0.0]
In this study, we propose a supervised Multimodal Building Height Network (MBHR-Net) for estimating building heights at 10m spatial resolution using Sentinel-1 (S1) and Sentinel-2 (S2) time series.
Our MBHR-Net aims to extract meaningful features from the S1 and S2 images to learn complex-temporal relationships between image patterns and building heights.
The model is trained and tested in 10 cities in the Netherlands Root Mean Squared Error (RMSE), Intersection over Union (IoU), and R-squared (R2) score metrics are used to evaluate the performance of the model.
arXiv Detail & Related papers (2023-07-03T22:16:17Z) - LIGHT: Joint Individual Building Extraction and Height Estimation from
Satellite Images through a Unified Multitask Learning Network [8.09909901104654]
Building extraction and height estimation are two important basic tasks in remote sensing image interpretation.
Most of the existing research regards the two tasks as independent studies.
In this work, we combine the individuaL buIlding extraction and heiGHt estimation through a unified multiTask learning network.
arXiv Detail & Related papers (2023-04-03T15:48:24Z) - 3D Cascade RCNN: High Quality Object Detection in Point Clouds [122.42455210196262]
We present 3D Cascade RCNN, which allocates multiple detectors based on the voxelized point clouds in a cascade paradigm.
We validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques.
arXiv Detail & Related papers (2022-11-15T15:58:36Z) - Learning Local Displacements for Point Cloud Completion [93.54286830844134]
We propose a novel approach aimed at object and semantic scene completion from a partial scan represented as a 3D point cloud.
Our architecture relies on three novel layers that are used successively within an encoder-decoder structure.
We evaluate both architectures on object and indoor scene completion tasks, achieving state-of-the-art performance.
arXiv Detail & Related papers (2022-03-30T18:31:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.