TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for
Improved Building Footprint Extraction
- URL: http://arxiv.org/abs/2311.02617v1
- Date: Sun, 5 Nov 2023 10:52:16 GMT
- Title: TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for
Improved Building Footprint Extraction
- Authors: Muhammad Ahmad Waseem, Muhammad Tahir, Zubair Khalid, and Momin Uppal
- Abstract summary: We propose a novel tuning Fork Network (TFNet) design for deep semantic segmentation.
The TFNet design is coupled with a novel methodology of incorporating neighborhood information at the tile boundaries during the training process.
For performance comparisons, we utilize the SpaceNet2 and WHU datasets, as well as a dataset from an area in Lahore, Pakistan that captures closely connected buildings.
- Score: 11.845097068829551
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper considers the problem of extracting building footprints from
satellite imagery -- a task that is critical for many urban planning and
decision-making applications. While recent advancements in deep learning have
made great strides in automated detection of building footprints,
state-of-the-art methods available in existing literature often generate
erroneous results for areas with densely connected buildings. Moreover, these
methods do not incorporate the context of neighborhood images during training
thus generally resulting in poor performance at image boundaries. In light of
these gaps, we propose a novel Tuning Fork Network (TFNet) design for deep
semantic segmentation that not only performs well for widely-spaced building
but also has good performance for buildings that are closely packed together.
The novelty of TFNet architecture lies in a a single encoder followed by two
parallel decoders to separately reconstruct the building footprint and the
building edge. In addition, the TFNet design is coupled with a novel
methodology of incorporating neighborhood information at the tile boundaries
during the training process. This methodology further improves performance,
especially at the tile boundaries. For performance comparisons, we utilize the
SpaceNet2 and WHU datasets, as well as a dataset from an area in Lahore,
Pakistan that captures closely connected buildings. For all three datasets, the
proposed methodology is found to significantly outperform benchmark methods.
Related papers
- Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans [1.0436971860292366]
We propose two novel pixel-wise segmentation methods based on the MDA-Unet and MACU-Net architectures.
The proposed methods are compared with two other state-of-the-art techniques and several benchmark datasets.
On the commonly used CubiCasa benchmark dataset, our methods have achieved the mean F1 score of 0.86 over five examined classes.
arXiv Detail & Related papers (2024-08-02T18:36:45Z) - ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic
Reconstruction [62.599588577671796]
We propose an online 3D semantic segmentation method that incrementally reconstructs a 3D semantic map from a stream of RGB-D frames.
Unlike offline methods, ours is directly applicable to scenarios with real-time constraints, such as robotics or mixed reality.
arXiv Detail & Related papers (2023-11-29T20:30:18Z) - Building Extraction from Remote Sensing Images via an Uncertainty-Aware
Network [18.365220543556113]
Building extraction plays an essential role in many applications, such as city planning and urban dynamic monitoring.
We propose a novel and straightforward Uncertainty-Aware Network (UANet) to alleviate this problem.
Results demonstrate that the proposed UANet outperforms other state-of-the-art algorithms by a large margin.
arXiv Detail & Related papers (2023-07-23T12:42:15Z) - Building Footprint Extraction with Graph Convolutional Network [20.335884170850193]
Building footprint information is an essential ingredient for 3-D reconstruction of urban models.
Recent developments in deep convolutional neural networks (DCNNs) have enabled accurate pixel-level labeling tasks.
In this work, we have proposed a end-to-end framework to overcome this issue, which uses the graph convolutional network (GCN) for building footprint extraction task.
arXiv Detail & Related papers (2023-05-08T06:50:05Z) - MultiScale Probability Map guided Index Pooling with Attention-based
learning for Road and Building Segmentation [18.838213902873616]
We propose a novel attention-aware segmentation framework, Multi-Scale Supervised Dilated Multiple-Path Attention Network (MSSDMPA-Net)
MSSDMPA-Net is equipped with two new modules Dynamic Attention Map Guided Index Pooling (DAMIP) and Dynamic Attention Map Guided Spatial and Channel Attention (DAMSCA) to precisely extract the building footprints and road maps from remotely sensed images.
arXiv Detail & Related papers (2023-02-18T19:57:25Z) - Neural 3D Scene Reconstruction with the Manhattan-world Assumption [58.90559966227361]
This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view images.
Planar constraints can be conveniently integrated into the recent implicit neural representation-based reconstruction methods.
The proposed method outperforms previous methods by a large margin on 3D reconstruction quality.
arXiv Detail & Related papers (2022-05-05T17:59:55Z) - Learning Local Displacements for Point Cloud Completion [93.54286830844134]
We propose a novel approach aimed at object and semantic scene completion from a partial scan represented as a 3D point cloud.
Our architecture relies on three novel layers that are used successively within an encoder-decoder structure.
We evaluate both architectures on object and indoor scene completion tasks, achieving state-of-the-art performance.
arXiv Detail & Related papers (2022-03-30T18:31:37Z) - BuildingNet: Learning to Label 3D Buildings [19.641000866952815]
BuildingNet: (a) large-scale 3D building models whose exteriors consistently labeled, (b) a neural network that labels building analyzing and structural relations of their geometric primitives.
The dataset covers categories, such as houses, churches, skyscrapers, town halls and castles.
arXiv Detail & Related papers (2021-10-11T01:45:26Z) - Boundary-Aware Segmentation Network for Mobile and Web Applications [60.815545591314915]
Boundary-Aware Network (BASNet) is integrated with a predict-refine architecture and a hybrid loss for highly accurate image segmentation.
BASNet runs at over 70 fps on a single GPU which benefits many potential real applications.
Based on BASNet, we further developed two (close to) commercial applications: AR COPY & PASTE, in which BASNet is augmented reality for "COPY" and "PASTING" real-world objects, and OBJECT CUT, which is a web-based tool for automatic object background removal.
arXiv Detail & Related papers (2021-01-12T19:20:26Z) - Structured Convolutions for Efficient Neural Network Design [65.36569572213027]
We tackle model efficiency by exploiting redundancy in the textitimplicit structure of the building blocks of convolutional neural networks.
We show how this decomposition can be applied to 2D and 3D kernels as well as the fully-connected layers.
arXiv Detail & Related papers (2020-08-06T04:38:38Z) - Real-Time High-Performance Semantic Image Segmentation of Urban Street
Scenes [98.65457534223539]
We propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes.
The proposed method achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) with the inference speed of 51.0 fps and 39.3 fps.
arXiv Detail & Related papers (2020-03-11T08:45:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.