Cross-Region Building Counting in Satellite Imagery using Counting
Consistency
- URL: http://arxiv.org/abs/2110.13558v2
- Date: Sun, 13 Aug 2023 07:16:43 GMT
- Title: Cross-Region Building Counting in Satellite Imagery using Counting
Consistency
- Authors: Muaaz Zakria, Hamza Rawal, Waqas Sultani, Mohsen Ali
- Abstract summary: Estimating the number of buildings in any geographical region is a vital component of urban analysis, disaster management, and public policy decision.
Deep learning methods for building localization and counting in satellite imagery, can serve as a viable and cheap alternative.
However, these algorithms suffer performance degradation when applied to the regions on which they have not been trained.
- Score: 8.732274235941974
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimating the number of buildings in any geographical region is a vital
component of urban analysis, disaster management, and public policy decision.
Deep learning methods for building localization and counting in satellite
imagery, can serve as a viable and cheap alternative. However, these algorithms
suffer performance degradation when applied to the regions on which they have
not been trained. Current large datasets mostly cover the developed regions and
collecting such datasets for every region is a costly, time-consuming, and
difficult endeavor. In this paper, we propose an unsupervised domain adaptation
method for counting buildings where we use a labeled source domain (developed
regions) and adapt the trained model on an unlabeled target domain (developing
regions). We initially align distribution maps across domains by aligning the
output space distribution through adversarial loss. We then exploit counting
consistency constraints, within-image count consistency, and across-image count
consistency, to decrease the domain shift. Within-image consistency enforces
that building count in the whole image should be greater than or equal to count
in any of its sub-image. Across-image consistency constraint enforces that if
an image contains considerably more buildings than the other image, then their
sub-images shall also have the same order. These two constraints encourage the
behavior to be consistent across and within the images, regardless of the
scale. To evaluate the performance of our proposed approach, we collected and
annotated a large-scale dataset consisting of challenging South Asian regions
having higher building densities and irregular structures as compared to
existing datasets. We perform extensive experiments to verify the efficacy of
our approach and report improvements of approximately 7% to 20% over the
competitive baseline methods.
Related papers
- Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data [66.49494950674402]
We leverage emerging text-to-image generative models in creating large-scale synthetic supervision for the task of damage assessment from aerial images.
We build an efficient and easily scalable pipeline to generate thousands of post-disaster images from low-resource domains.
We validate the strength of our proposed framework under cross-geography domain transfer setting from xBD and SKAI images in both single-source and multi-source settings.
arXiv Detail & Related papers (2024-05-22T16:07:05Z) - Getting it Right: Improving Spatial Consistency in Text-to-Image Models [103.52640413616436]
One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt.
We create SPRIGHT, the first spatially-focused, large scale dataset, by re-captioning 6 million images from 4 widely used vision datasets.
We attain state-of-the-art on T2I-CompBench with a spatial score of 0.2133, by fine-tuning on 500 images.
arXiv Detail & Related papers (2024-04-01T15:55:25Z) - Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning [50.88504784466931]
Multi-task dense prediction involves semantic segmentation, depth estimation, and surface normal estimation.
Existing solutions typically rely on learning global image representations for global cross-task image matching.
Our proposal involves modeling region-wise representations using Gaussian Distributions.
arXiv Detail & Related papers (2024-03-15T12:41:30Z) - Progressive Feature Self-reinforcement for Weakly Supervised Semantic
Segmentation [55.69128107473125]
We propose a single-stage approach for Weakly Supervised Semantic (WSSS) with image-level labels.
We adaptively partition the image content into deterministic regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing.
Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.
arXiv Detail & Related papers (2023-12-14T13:21:52Z) - Self-supervised Domain-agnostic Domain Adaptation for Satellite Images [18.151134198549574]
We propose an self-supervised domain-agnostic domain adaptation (SS(DA)2) method to perform domain adaptation without such a domain definition.
We first design a contrastive generative adversarial loss to train a generative network to perform image-to-image translation between any two satellite image patches.
Then, we improve the generalizability of the downstream models by augmenting the training data with different testing spectral characteristics.
arXiv Detail & Related papers (2023-09-20T07:37:23Z) - Building Footprint Extraction in Dense Areas using Super Resolution and
Frame Field Learning [1.949927790632678]
Super resolution is employed to enhance the spatial resolution of aerial image, allowing for finer details to be captured.
This enhanced imagery serves as input to a multitask learning module, which consists of a segmentation head and a frame field learning head.
Our model is supervised by adaptive loss weighting, enabling extraction of sharp edges and fine-grained polygons.
arXiv Detail & Related papers (2023-09-04T15:15:34Z) - Activation Regression for Continuous Domain Generalization with
Applications to Crop Classification [48.795866501365694]
Geographic variance in satellite imagery impacts the ability of machine learning models to generalise to new regions.
We model geographic generalisation in medium resolution Landsat-8 satellite imagery as a continuous domain adaptation problem.
We develop a dataset spatially distributed across the entire continental United States.
arXiv Detail & Related papers (2022-04-14T15:41:39Z) - An Efficient Method for the Classification of Croplands in Scarce-Label
Regions [0.0]
Two of the main challenges for cropland classification by satellite time-series images are insufficient ground-truth data and inaccessibility of high-quality hyperspectral images for under-developed areas.
Unlabeled medium-resolution satellite images are abundant, but how to benefit from them is an open question.
We will show how to leverage their potential for cropland classification using self-supervised tasks.
arXiv Detail & Related papers (2021-03-17T12:10:11Z) - Weakly Supervised Domain Adaptation for Built-up Region Segmentation in
Aerial and Satellite Imagery [3.8508264614798517]
Built-up area estimation is an important component in understanding the human impact on the environment, the effect of public policy, and general urban population analysis.
The diverse nature of aerial and satellite imagery and lack of labeled data covering this diversity makes machine learning algorithms difficult to generalize.
This paper proposes a novel domain adaptation algorithm to handle the challenges posed by the satellite and aerial imagery.
arXiv Detail & Related papers (2020-07-05T10:05:01Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.