BuildSeg: A General Framework for the Segmentation of Buildings
- URL: http://arxiv.org/abs/2301.06190v1
- Date: Sun, 15 Jan 2023 21:09:00 GMT
- Title: BuildSeg: A General Framework for the Segmentation of Buildings
- Authors: Lei Li, Tianfang Zhang, Stefan Oehmcke, Fabian Gieseke, Christian Igel
- Abstract summary: Building segmentation from aerial images and 3D laser scanning (LiDAR) is a challenging task due to the diversity of backgrounds, building textures, and image quality.
We propose a general framework termed emphBuildSeg employing a generic approach that can be quickly applied to segment buildings.
- Score: 19.296282254565885
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Building segmentation from aerial images and 3D laser scanning (LiDAR) is a
challenging task due to the diversity of backgrounds, building textures, and
image quality. While current research using different types of convolutional
and transformer networks has considerably improved the performance on this
task, even more accurate segmentation methods for buildings are desirable for
applications such as automatic mapping. In this study, we propose a general
framework termed \emph{BuildSeg} employing a generic approach that can be
quickly applied to segment buildings. Different data sources were combined to
increase generalization performance. The approach yields good results for
different data sources as shown by experiments on high-resolution
multi-spectral and LiDAR imagery of cities in Norway, Denmark and France. We
applied ConvNeXt and SegFormer based models on the high resolution aerial image
dataset from the MapAI-competition. The methods achieved an IOU of 0.7902 and a
boundary IOU of 0.6185. We used post-processing to account for the rectangular
shape of the objects. This increased the boundary IOU from 0.6185 to 0.6189.
Related papers
- MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images [4.2623421577291225]
We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR localisation.
Our framework achieves an average of 0.72 IoU (segmentation) and 0.90 F1 score (detection) across all modalities.
arXiv Detail & Related papers (2024-07-19T17:21:53Z) - Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search [49.81353382211113]
We address the challenge of integrating multi-head self-attention into high resolution representation CNNs efficiently.
We develop a multi-target multi-branch supernet method, which fully utilizes the advantages of high-resolution features.
We present a series of model via Hybrid Convolutional-Transformer Architecture Search (HyCTAS) method that searched for the best hybrid combination of light-weight convolution layers and memory-efficient self-attention layers.
arXiv Detail & Related papers (2024-03-15T15:47:54Z) - Simple and Effective Synthesis of Indoor 3D Scenes [78.95697556834536]
We study the problem of immersive 3D indoor scenes from one or more images.
Our aim is to generate high-resolution images and videos from novel viewpoints.
We propose an image-to-image GAN that maps directly from reprojections of incomplete point clouds to full high-resolution RGB-D images.
arXiv Detail & Related papers (2022-04-06T17:54:46Z) - Panoptic SwiftNet: Pyramidal Fusion for Real-time Panoptic Segmentation [0.0]
Many applications require fast inference over large input resolutions on affordable or even embedded hardware.
We propose to achieve this goal by trading off backbone capacity for multi-scale feature extraction.
We present panoptic experiments on Cityscapes, Vistas, COCO and the BSB-Aerial dataset.
arXiv Detail & Related papers (2022-03-15T13:47:40Z) - Boosting Robustness of Image Matting with Context Assembling and Strong
Data Augmentation [83.31087402305306]
robustness to trimaps and generalization to images from different domains is still under-explored.
We propose an image matting method which achieves higher robustness (RMat) via multilevel context assembling and strong data augmentation targeting matting.
arXiv Detail & Related papers (2022-01-18T11:45:17Z) - Sci-Net: a Scale Invariant Model for Building Detection from Aerial
Images [0.0]
We propose a Scale-invariant neural network (Sci-Net) that is able to segment buildings present in aerial images at different spatial resolutions.
Specifically, we modified the U-Net architecture and fused it with dense Atrous Spatial Pyramid Pooling (ASPP) to extract fine-grained multi-scale representations.
arXiv Detail & Related papers (2021-11-12T16:45:20Z) - Continental-Scale Building Detection from High Resolution Satellite
Imagery [5.56205296867374]
We study variations in architecture, loss functions, regularization, pre-training, self-training and post-processing that increase instance segmentation performance.
Experiments were carried out using a dataset of 100k satellite images across Africa containing 1.75M manually labelled building instances.
We report novel methods for improving performance of building detection with this type of model, including the use of mixup.
arXiv Detail & Related papers (2021-07-26T15:48:14Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - Object Detection in Aerial Images: A Large-Scale Benchmark and
Challenges [124.48654341780431]
We present a large-scale dataset of Object deTection in Aerial images (DOTA) and comprehensive baselines for ODAI.
The proposed DOTA dataset contains 1,793,658 object instances of 18 categories of oriented-bounding-box annotations collected from 11,268 aerial images.
We build baselines covering 10 state-of-the-art algorithms with over 70 configurations, where the speed and accuracy performances of each model have been evaluated.
arXiv Detail & Related papers (2021-02-24T11:20:55Z) - Contextual Pyramid Attention Network for Building Segmentation in Aerial
Imagery [12.241693880896348]
Building extraction from aerial images has several applications in problems such as urban planning, change detection, and disaster management.
We propose to improve building segmentation of different sizes by capturing long-range dependencies using contextual pyramid attention (CPA)
Our method improves 1.8 points over current state-of-the-art methods and 12.6 points higher than existing baselines without any post-processing.
arXiv Detail & Related papers (2020-04-15T11:36:26Z) - Image Matching across Wide Baselines: From Paper to Practice [80.9424750998559]
We introduce a comprehensive benchmark for local features and robust estimation algorithms.
Our pipeline's modular structure allows easy integration, configuration, and combination of different methods.
We show that with proper settings, classical solutions may still outperform the perceived state of the art.
arXiv Detail & Related papers (2020-03-03T15:20:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.