Building Footprint Generation by IntegratingConvolution Neural Network
with Feature PairwiseConditional Random Field (FPCRF)
- URL: http://arxiv.org/abs/2002.04600v1
- Date: Tue, 11 Feb 2020 18:51:19 GMT
- Title: Building Footprint Generation by IntegratingConvolution Neural Network
with Feature PairwiseConditional Random Field (FPCRF)
- Authors: Qingyu Li, Yilei Shi, Xin Huang, Xiao Xiang Zhu
- Abstract summary: Building footprint maps are vital to many remote sensing applications, such as 3D building modeling, urban planning, and disaster management.
In this work, an end-to-end building footprint generation approach that integrates convolution neural network (CNN) and graph model is proposed.
- Score: 21.698236040666675
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Building footprint maps are vital to many remote sensing applications, such
as 3D building modeling, urban planning, and disaster management. Due to the
complexity of buildings, the accurate and reliable generation of the building
footprint from remote sensing imagery is still a challenging task. In this
work, an end-to-end building footprint generation approach that integrates
convolution neural network (CNN) and graph model is proposed. CNN serves as the
feature extractor, while the graph model can take spatial correlation into
consideration. Moreover, we propose to implement the feature pairwise
conditional random field (FPCRF) as a graph model to preserve sharp boundaries
and fine-grained segmentation. Experiments are conducted on four different
datasets: (1) Planetscope satellite imagery of the cities of Munich, Paris,
Rome, and Zurich; (2) ISPRS benchmark data from the city of Potsdam, (3) Dstl
Kaggle dataset; and (4) Inria Aerial Image Labeling data of Austin, Chicago,
Kitsap County, Western Tyrol, and Vienna. It is found that the proposed
end-to-end building footprint generation framework with the FPCRF as the graph
model can further improve the accuracy of building footprint generation by
using only CNN, which is the current state-of-the-art.
Related papers
- Graph Transformer GANs with Graph Masked Modeling for Architectural
Layout Generation [153.92387500677023]
We present a novel graph Transformer generative adversarial network (GTGAN) to learn effective graph node relations.
The proposed graph Transformer encoder combines graph convolutions and self-attentions in a Transformer to model both local and global interactions.
We also propose a novel self-guided pre-training method for graph representation learning.
arXiv Detail & Related papers (2024-01-15T14:36:38Z) - TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for
Improved Building Footprint Extraction [11.845097068829551]
We propose a novel tuning Fork Network (TFNet) design for deep semantic segmentation.
The TFNet design is coupled with a novel methodology of incorporating neighborhood information at the tile boundaries during the training process.
For performance comparisons, we utilize the SpaceNet2 and WHU datasets, as well as a dataset from an area in Lahore, Pakistan that captures closely connected buildings.
arXiv Detail & Related papers (2023-11-05T10:52:16Z) - AutoEncoding Tree for City Generation and Applications [33.364915512018364]
Huge volumes of spatial data in cities pose a challenge to the generative models.
Few publicly available 3D real-world city datasets also hinder the development of methods for city generation.
We propose AETree, a tree-structured auto-encoder neural network, for city generation.
arXiv Detail & Related papers (2023-09-27T18:36:56Z) - Building Extraction from Remote Sensing Images via an Uncertainty-Aware
Network [18.365220543556113]
Building extraction plays an essential role in many applications, such as city planning and urban dynamic monitoring.
We propose a novel and straightforward Uncertainty-Aware Network (UANet) to alleviate this problem.
Results demonstrate that the proposed UANet outperforms other state-of-the-art algorithms by a large margin.
arXiv Detail & Related papers (2023-07-23T12:42:15Z) - Semi-supervised Learning from Street-View Images and OpenStreetMap for
Automatic Building Height Estimation [59.6553058160943]
We propose a semi-supervised learning (SSL) method of automatically estimating building height from Mapillary SVI and OpenStreetMap data.
The proposed method leads to a clear performance boosting in estimating building heights with a Mean Absolute Error (MAE) around 2.1 meters.
The preliminary result is promising and motivates our future work in scaling up the proposed method based on low-cost VGI data.
arXiv Detail & Related papers (2023-07-05T18:16:30Z) - Building Footprint Extraction with Graph Convolutional Network [20.335884170850193]
Building footprint information is an essential ingredient for 3-D reconstruction of urban models.
Recent developments in deep convolutional neural networks (DCNNs) have enabled accurate pixel-level labeling tasks.
In this work, we have proposed a end-to-end framework to overcome this issue, which uses the graph convolutional network (GCN) for building footprint extraction task.
arXiv Detail & Related papers (2023-05-08T06:50:05Z) - Elevation Estimation-Driven Building 3D Reconstruction from Single-View
Remote Sensing Imagery [20.001807614214922]
Building 3D reconstruction from remote sensing images has a wide range of applications in smart cities, photogrammetry and other fields.
We propose an efficient DSM estimation-driven reconstruction framework (Building3D) to reconstruct 3D building models from the input single-view remote sensing image.
Our Building3D is rooted in the SFFDE network for building elevation prediction, synchronized with a building extraction network for building masks, and then sequentially performs point cloud reconstruction, surface reconstruction (or CityGML model reconstruction)
arXiv Detail & Related papers (2023-01-11T17:20:30Z) - Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge
Transfer [58.6106391721944]
Cross-city knowledge has shown its promise, where the model learned from data-sufficient cities is leveraged to benefit the learning process of data-scarce cities.
We propose a model-agnostic few-shot learning framework for S-temporal graph called ST-GFSL.
We conduct comprehensive experiments on four traffic speed prediction benchmarks and the results demonstrate the effectiveness of ST-GFSL compared with state-of-the-art methods.
arXiv Detail & Related papers (2022-05-27T12:46:52Z) - PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object
Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving.
Current approaches suffer from sparse and partial point clouds of distant and occluded objects.
In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z) - Mix Dimension in Poincar\'{e} Geometry for 3D Skeleton-based Action
Recognition [57.98278794950759]
Graph Convolutional Networks (GCNs) have already demonstrated their powerful ability to model the irregular data.
We present a novel spatial-temporal GCN architecture which is defined via the Poincar'e geometry.
We evaluate our method on two current largest scale 3D datasets.
arXiv Detail & Related papers (2020-07-30T18:23:18Z) - Local Grid Rendering Networks for 3D Object Detection in Point Clouds [98.02655863113154]
CNNs are powerful but it would be computationally costly to directly apply convolutions on point data after voxelizing the entire point clouds to a dense regular 3D grid.
We propose a novel and principled Local Grid Rendering (LGR) operation to render the small neighborhood of a subset of input points into a low-resolution 3D grid independently.
We validate LGR-Net for 3D object detection on the challenging ScanNet and SUN RGB-D datasets.
arXiv Detail & Related papers (2020-07-04T13:57:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.