Transformer Based Building Boundary Reconstruction using Attraction Field Maps
- URL: http://arxiv.org/abs/2507.17038v1
- Date: Tue, 22 Jul 2025 21:53:03 GMT
- Title: Transformer Based Building Boundary Reconstruction using Attraction Field Maps
- Authors: Muhammad Kamran, Mohammad Moein Sheikholeslami, Andreas Wichmann, Gunho Sohn,
- Abstract summary: This paper introduces a novel deep learning methodology leveraging Graph Convolutional Networks (GCNs) to address these challenges in building footprint reconstruction.<n>Our model, Decoupled-PolyGCN, outperforms existing methods by 6% in AP and 10% in AR, demonstrating its ability to deliver accurate and regularized building footprints.
- Score: 0.3749861135832072
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, the number of remote satellites orbiting the Earth has grown significantly, streaming vast amounts of high-resolution visual data to support diverse applications across civil, public, and military domains. Among these applications, the generation and updating of spatial maps of the built environment have become critical due to the extensive coverage and detailed imagery provided by satellites. However, reconstructing spatial maps from satellite imagery is a complex computer vision task, requiring the creation of high-level object representations, such as primitives, to accurately capture the built environment. While the past decade has witnessed remarkable advancements in object detection and representation using visual data, primitives-based object representation remains a persistent challenge in computer vision. Consequently, high-quality spatial maps often rely on labor-intensive and manual processes. This paper introduces a novel deep learning methodology leveraging Graph Convolutional Networks (GCNs) to address these challenges in building footprint reconstruction. The proposed approach enhances performance by incorporating geometric regularity into building boundaries, integrating multi-scale and multi-resolution features, and embedding Attraction Field Maps into the network. These innovations provide a scalable and precise solution for automated building footprint extraction from a single satellite image, paving the way for impactful applications in urban planning, disaster management, and large-scale spatial analysis. Our model, Decoupled-PolyGCN, outperforms existing methods by 6% in AP and 10% in AR, demonstrating its ability to deliver accurate and regularized building footprints across diverse and challenging scenarios.
Related papers
- On the use of Graphs for Satellite Image Time Series [3.2623791881739033]
This paper is an effort to examine the integration of graph-based methods in remote-sensing analysis.<n>It aims to present a versatile graph-based pipeline to tackle SITS analysis.<n>The paper includes a review and two case studies, which highlight the potential of graph-based approaches for land cover mapping and water forecasting datasets.
arXiv Detail & Related papers (2025-05-22T13:53:36Z) - Data Augmentation and Resolution Enhancement using GANs and Diffusion Models for Tree Segmentation [49.13393683126712]
Urban forests play a key role in enhancing environmental quality and supporting biodiversity in cities.<n> accurately detecting trees is challenging due to complex landscapes and the variability in image resolution caused by different satellite sensors or UAV flight altitudes.<n>We propose a novel pipeline that integrates domain adaptation with GANs and Diffusion models to enhance the quality of low-resolution aerial images.
arXiv Detail & Related papers (2025-05-21T03:57:10Z) - AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis [57.249817395828174]
We propose a scalable framework combining pseudo-synthetic renderings from 3D city-wide meshes with real, ground-level crowd-sourced images.<n>The pseudo-synthetic data simulates a wide range of aerial viewpoints, while the real, crowd-sourced images help improve visual fidelity for ground-level images.<n>Using this hybrid dataset, we fine-tune several state-of-the-art algorithms and achieve significant improvements on real-world, zero-shot aerial-ground tasks.
arXiv Detail & Related papers (2025-04-17T17:57:05Z) - Can Location Embeddings Enhance Super-Resolution of Satellite Imagery? [2.3020018305241337]
Publicly available satellite imagery, such as Sentinel- 2, often lacks the spatial resolution required for accurate analysis of remote sensing tasks.<n>We propose a novel super-resolution framework that enhances generalization by incorporating geographic context through location embeddings.<n>We demonstrate the effectiveness of our method on the building segmentation task, showing significant improvements over state-of-the-art methods.
arXiv Detail & Related papers (2025-01-27T08:16:54Z) - AerialGo: Walking-through City View Generation from Aerial Perspectives [48.53976414257845]
AerialGo is a framework that generates realistic walking-through city views from aerial images.<n>By conditioning ground-view synthesis on accessible aerial data, AerialGo bypasses the privacy risks inherent in ground-level imagery.<n>Experiments show that AerialGo significantly enhances ground-level realism and structural coherence.
arXiv Detail & Related papers (2024-11-29T08:14:07Z) - Cross Pseudo Supervision Framework for Sparsely Labelled Geospatial Images [0.0]
Land Use Land Cover (LULC) mapping is a vital tool for urban and resource planning.
This study introduces a semi-supervised segmentation model for LULC prediction using high-resolution satellite images.
We propose a modified Cross Pseudo Supervision framework to train image segmentation models on sparsely labelled data.
arXiv Detail & Related papers (2024-08-05T11:14:23Z) - A General Purpose Neural Architecture for Geospatial Systems [142.43454584836812]
We present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias.
We envision how such a model may facilitate cooperation between members of the community.
arXiv Detail & Related papers (2022-11-04T09:58:57Z) - Tracking Urbanization in Developing Regions with Remote Sensing
Spatial-Temporal Super-Resolution [82.50301442891602]
We propose a pipeline that leverages a single high-resolution image and a time series of publicly available low-resolution images.
Our method achieves significant improvement in comparison to baselines using single image super-resolution.
arXiv Detail & Related papers (2022-04-04T17:21:20Z) - Occupancy Anticipation for Efficient Exploration and Navigation [97.17517060585875]
We propose occupancy anticipation, where the agent uses its egocentric RGB-D observations to infer the occupancy state beyond the visible regions.
By exploiting context in both the egocentric views and top-down maps our model successfully anticipates a broader map of the environment.
Our approach is the winning entry in the 2020 Habitat PointNav Challenge.
arXiv Detail & Related papers (2020-08-21T03:16:51Z) - Weakly Supervised Domain Adaptation for Built-up Region Segmentation in
Aerial and Satellite Imagery [3.8508264614798517]
Built-up area estimation is an important component in understanding the human impact on the environment, the effect of public policy, and general urban population analysis.
The diverse nature of aerial and satellite imagery and lack of labeled data covering this diversity makes machine learning algorithms difficult to generalize.
This paper proposes a novel domain adaptation algorithm to handle the challenges posed by the satellite and aerial imagery.
arXiv Detail & Related papers (2020-07-05T10:05:01Z) - Boundary Regularized Building Footprint Extraction From Satellite Images
Using Deep Neural Network [6.371173732947292]
We propose a novel deep neural network, which enables to jointly detect building instance and regularize noisy building boundary shapes from a single satellite imagery.
Our model can accomplish multi-tasks of object localization, recognition, semantic labelling and geometric shape extraction simultaneously.
arXiv Detail & Related papers (2020-06-23T17:24:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.