A Quantum-assisted Attention U-Net for Building Segmentation over Tunis using Sentinel-1 Data
- URL: http://arxiv.org/abs/2507.13852v1
- Date: Fri, 18 Jul 2025 12:16:04 GMT
- Title: A Quantum-assisted Attention U-Net for Building Segmentation over Tunis using Sentinel-1 Data
- Authors: Luigi Russo, Francesco Mauro, Babak Memar, Alessandro Sebastianelli, Silvia Liberata Ullo, Paolo Gamba,
- Abstract summary: Building segmentation in urban areas is essential in fields such as urban planning, disaster response, and population mapping.<n>Yet accurately segmenting buildings in dense urban regions presents challenges due to the large size and high resolution of satellite images.<n>This study investigates the use of a Quanvolutional pre-processing to enhance the capability of the Attention U-Net model in the building segmentation.
- Score: 39.039210749657194
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Building segmentation in urban areas is essential in fields such as urban planning, disaster response, and population mapping. Yet accurately segmenting buildings in dense urban regions presents challenges due to the large size and high resolution of satellite images. This study investigates the use of a Quanvolutional pre-processing to enhance the capability of the Attention U-Net model in the building segmentation. Specifically, this paper focuses on the urban landscape of Tunis, utilizing Sentinel-1 Synthetic Aperture Radar (SAR) imagery. In this work, Quanvolution was used to extract more informative feature maps that capture essential structural details in radar imagery, proving beneficial for accurate building segmentation. Preliminary results indicate that proposed methodology achieves comparable test accuracy to the standard Attention U-Net model while significantly reducing network parameters. This result aligns with findings from previous works, confirming that Quanvolution not only maintains model accuracy but also increases computational efficiency. These promising outcomes highlight the potential of quantum-assisted Deep Learning frameworks for large-scale building segmentation in urban environments.
Related papers
- Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Point Cloud Segmentation Using Transfer Learning with RandLA-Net: A Case
Study on Urban Areas [0.5242869847419834]
This paper presents the application of RandLA-Net, a state-of-the-art neural network architecture, for the 3D segmentation of large-scale point cloud data in urban areas.
The study focuses on three major Chinese cities, namely Chengdu, Jiaoda, and Shenzhen, leveraging their unique characteristics to enhance segmentation performance.
arXiv Detail & Related papers (2023-12-19T06:13:58Z) - Innovative Horizons in Aerial Imagery: LSKNet Meets DiffusionDet for
Advanced Object Detection [55.2480439325792]
We present an in-depth evaluation of an object detection model that integrates the LSKNet backbone with the DiffusionDet head.
The proposed model achieves a mean average precision (MAP) of approximately 45.7%, which is a significant improvement.
This advancement underscores the effectiveness of the proposed modifications and sets a new benchmark in aerial image analysis.
arXiv Detail & Related papers (2023-11-21T19:49:13Z) - TFNet: Tuning Fork Network with Neighborhood Pixel Aggregation for
Improved Building Footprint Extraction [11.845097068829551]
We propose a novel tuning Fork Network (TFNet) design for deep semantic segmentation.
The TFNet design is coupled with a novel methodology of incorporating neighborhood information at the tile boundaries during the training process.
For performance comparisons, we utilize the SpaceNet2 and WHU datasets, as well as a dataset from an area in Lahore, Pakistan that captures closely connected buildings.
arXiv Detail & Related papers (2023-11-05T10:52:16Z) - Cross-City Matters: A Multimodal Remote Sensing Benchmark Dataset for
Cross-City Semantic Segmentation using High-Resolution Domain Adaptation
Networks [82.82866901799565]
We build a new set of multimodal remote sensing benchmark datasets (including hyperspectral, multispectral, SAR) for the study purpose of the cross-city semantic segmentation task.
Beyond the single city, we propose a high-resolution domain adaptation network, HighDAN, to promote the AI model's generalization ability from the multi-city environments.
HighDAN is capable of retaining the spatially topological structure of the studied urban scene well in a parallel high-to-low resolution fusion fashion.
arXiv Detail & Related papers (2023-09-26T23:55:39Z) - Performance Analysis of Various EfficientNet Based U-Net++ Architecture
for Automatic Building Extraction from High Resolution Satellite Images [0.0]
Building extraction heavily relies on semantic segmentation of high-resolution remote sensing imagery.
Various efficientNet backbone based U-Net++ has been proposed in this study.
According on the experimental findings, the suggested model significantly outperforms previous cutting-edge approaches.
arXiv Detail & Related papers (2023-09-05T18:14:14Z) - Building Extraction from Remote Sensing Images via an Uncertainty-Aware
Network [18.365220543556113]
Building extraction plays an essential role in many applications, such as city planning and urban dynamic monitoring.
We propose a novel and straightforward Uncertainty-Aware Network (UANet) to alleviate this problem.
Results demonstrate that the proposed UANet outperforms other state-of-the-art algorithms by a large margin.
arXiv Detail & Related papers (2023-07-23T12:42:15Z) - CG-Net: Conditional GIS-aware Network for Individual Building
Segmentation in VHR SAR Images [25.87229252642239]
This paper addresses the issue of individual building segmentation from a single VHR SAR image in large-scale urban areas.
We introduce building footprints from GIS data as complementary information and propose a novel conditional GIS-aware network (CG-Net)
The proposed model learns multi-level visual features and employs building footprints to normalize the features for predicting building masks in the SAR image.
arXiv Detail & Related papers (2020-11-17T01:52:22Z) - RescueNet: Joint Building Segmentation and Damage Assessment from
Satellite Imagery [83.49145695899388]
RescueNet is a unified model that can simultaneously segment buildings and assess the damage levels to individual buildings and can be trained end-to-end.
RescueNet is tested on the large scale and diverse xBD dataset and achieves significantly better building segmentation and damage classification performance than previous methods.
arXiv Detail & Related papers (2020-04-15T19:52:09Z) - Real-Time High-Performance Semantic Image Segmentation of Urban Street
Scenes [98.65457534223539]
We propose a real-time high-performance DCNN-based method for robust semantic segmentation of urban street scenes.
The proposed method achieves the accuracy of 73.6% and 68.0% mean Intersection over Union (mIoU) with the inference speed of 51.0 fps and 39.3 fps.
arXiv Detail & Related papers (2020-03-11T08:45:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.