Large-scale Building Height Retrieval from Single SAR Imagery based on
Bounding Box Regression Networks
- URL: http://arxiv.org/abs/2111.09460v1
- Date: Thu, 18 Nov 2021 00:39:48 GMT
- Title: Large-scale Building Height Retrieval from Single SAR Imagery based on
Bounding Box Regression Networks
- Authors: Yao Sun, Lichao Mou, Yuanyuan Wang, Sina Montazeri, Xiao Xiang Zhu
- Abstract summary: Building height retrieval from synthetic aperture radar (SAR) imagery is of great importance for urban applications.
This paper addresses the issue of building height retrieval in large-scale urban areas from a single TerraSAR-X spotlight or stripmap image.
- Score: 21.788338971571736
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Building height retrieval from synthetic aperture radar (SAR) imagery is of
great importance for urban applications, yet highly challenging owing to the
complexity of SAR data. This paper addresses the issue of building height
retrieval in large-scale urban areas from a single TerraSAR-X spotlight or
stripmap image. Based on the radar viewing geometry, we propose that this
problem can be formulated as a bounding box regression problem and therefore
allows for integrating height data from multiple data sources in generating
ground truth on a larger scale. We introduce building footprints from
geographic information system (GIS) data as complementary information and
propose a bounding box regression network that exploits the location
relationship between a building's footprint and its bounding box, allowing for
fast computation. This is important for large-scale applications. The method is
validated on four urban data sets using TerraSAR-X images in both
high-resolution spotlight and stripmap modes. Experimental results show that
the proposed network can reduce the computation cost significantly while
keeping the height accuracy of individual buildings compared to a Faster R-CNN
based method. Moreover, we investigate the impact of inaccurate GIS data on our
proposed network, and this study shows that the bounding box regression network
is robust against positioning errors in GIS data. The proposed method has great
potential to be applied to regional or even global scales.
Related papers
- CATSNet: a context-aware network for Height Estimation in a Forested Area based on Pol-TomoSAR data [4.9793121278328]
This work defines a context-aware deep learning-based solution named CATSNet.
A convolutional neural network is considered to leverage patch-based information and extract features from a neighborhood rather than focus on a single pixel.
The experimental results show striking advantages in both performance and ability by leveraging context information within Multiple Baselines (MB) TomoSAR data across different polarimetric modalities, surpassing existing techniques.
arXiv Detail & Related papers (2024-03-29T16:27:40Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.
Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Improving NeRF with Height Data for Utilization of GIS Data [0.0]
A method based on Neural Radiance Fields (NeRF) is proposed to effectively use height data which can be obtained from GIS (Geographic Information System)
As a result, the accuracy of image rendering was improved with faster training speed.
arXiv Detail & Related papers (2023-07-15T06:49:09Z) - Semi-supervised Learning from Street-View Images and OpenStreetMap for
Automatic Building Height Estimation [59.6553058160943]
We propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OpenStreetMap data.
The proposed method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters.
The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data.
arXiv Detail & Related papers (2023-07-05T18:16:30Z) - Learning Continuous Depth Representation via Geometric Spatial
Aggregator [47.1698365486215]
We propose a novel continuous depth representation for depth map super-resolution (DSR)
The heart of this representation is our proposed Geometric Spatial Aggregator (GSA), which exploits a distance field modulated by arbitrarily upsampled target gridding.
We also present a transformer-style backbone named GeoDSR, which possesses a principled way to construct the functional mapping between local coordinates.
arXiv Detail & Related papers (2022-12-07T07:48:23Z) - Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image
Matching [102.39635336450262]
We address the problem of ground-to-satellite image geo-localization by matching a query image captured at the ground level against a large-scale database with geotagged satellite images.
Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image.
arXiv Detail & Related papers (2022-03-26T20:10:38Z) - Focusing on Shadows for Predicting Heightmaps from Single Remotely
Sensed RGB Images with Deep Learning [3.42658286826597]
We propose a task-focused Deep Learning model that takes advantage of the shadow map of a remotely sensed image to calculate its heightmap.
We validate the model with a dataset covering a large area of Manchester, UK.
arXiv Detail & Related papers (2021-04-22T05:31:13Z) - CG-Net: Conditional GIS-aware Network for Individual Building
Segmentation in VHR SAR Images [25.87229252642239]
This paper addresses the issue of individual building segmentation from a single VHR SAR image in large-scale urban areas.
We introduce building footprints from GIS data as complementary information and propose a novel conditional GIS-aware network (CG-Net)
The proposed model learns multi-level visual features and employs building footprints to normalize the features for predicting building masks in the SAR image.
arXiv Detail & Related papers (2020-11-17T01:52:22Z) - Height estimation from single aerial images using a deep ordinal
regression network [12.991266182762597]
We deal with the ambiguous and unsolved problem of height estimation from a single aerial image.
Driven by the success of deep learning, especially deep convolution neural networks (CNNs), some researches have proposed to estimate height information from a single aerial image.
In this paper, we proposed to divide height values into spacing-increasing intervals and transform the regression problem into an ordinal regression problem.
arXiv Detail & Related papers (2020-06-04T12:03:51Z) - Real-Time High-Performance Semantic Image Segmentation of Urban Street
Scenes [98.65457534223539]
We propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes.
The proposed method achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) with the inference speed of 51.0 fps and 39.3 fps.
arXiv Detail & Related papers (2020-03-11T08:45:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.