Sensing population distribution from satellite imagery via deep
learning: model selection, neighboring effect, and systematic biases
- URL: http://arxiv.org/abs/2103.02155v1
- Date: Wed, 3 Mar 2021 03:40:24 GMT
- Title: Sensing population distribution from satellite imagery via deep
learning: model selection, neighboring effect, and systematic biases
- Authors: Xiao Huang, Di Zhu, Fan Zhang, Tao Liu, Xiao Li, Lei Zou
- Abstract summary: This study marks the first attempt to cross-compare performances of popular state-of-the-art deep learning models in estimating population distribution from remote sensing images.
DenseNet outperforms the other three models, while VGG has the worst performances in all evaluating metrics under all selected neighboring scenarios.
There exists a notable, universal bias that all selected deep learning models tend to overestimate sparsely populated image patches and underestimate densely populated image patches, regardless of neighboring sizes.
- Score: 16.82118960055405
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The rapid development of remote sensing techniques provides rich,
large-coverage, and high-temporal information of the ground, which can be
coupled with the emerging deep learning approaches that enable latent features
and hidden geographical patterns to be extracted. This study marks the first
attempt to cross-compare performances of popular state-of-the-art deep learning
models in estimating population distribution from remote sensing images,
investigate the contribution of neighboring effect, and explore the potential
systematic population estimation biases. We conduct an end-to-end training of
four popular deep learning architectures, i.e., VGG, ResNet, Xception, and
DenseNet, by establishing a mapping between Sentinel-2 image patches and their
corresponding population count from the LandScan population grid. The results
reveal that DenseNet outperforms the other three models, while VGG has the
worst performances in all evaluating metrics under all selected neighboring
scenarios. As for the neighboring effect, contradicting existing studies, our
results suggest that the increase of neighboring sizes leads to reduced
population estimation performance, which is found universal for all four
selected models in all evaluating metrics. In addition, there exists a notable,
universal bias that all selected deep learning models tend to overestimate
sparsely populated image patches and underestimate densely populated image
patches, regardless of neighboring sizes. The methodological, experimental, and
contextual knowledge this study provides is expected to benefit a wide range of
future studies that estimate population distribution via remote sensing
imagery.
Related papers
- Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery [0.0]
In the rise of climate change, land cover mapping has become such an urgent need in environmental monitoring.
This research implemented a semantic segmentation method such as Unet, Linknet, FPN, and PSPnet for categorizing vegetation, water, and others.
The LinkNet model obtained high accuracy in IoU at 0.92 in all datasets, which is comparable with other mentioned techniques.
arXiv Detail & Related papers (2024-06-20T11:40:12Z) - Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data [47.18802526899955]
Crowd-sourced perceptual datasets have emerged, with no images shared between triplets, making ranking infeasible.
We statistically model the underlying decision-making process during 2AFC experiments using a binomial distribution.
We calculate meaningful and well-founded metrics for the distance model, beyond the mere prediction accuracy as percentage agreement.
arXiv Detail & Related papers (2024-03-15T15:21:04Z) - Urban Region Embedding via Multi-View Contrastive Prediction [22.164358462563996]
We form a new pipeline to learn consistent representations across varying views.
Our model outperforms state-of-the-art baseline methods significantly in urban region representation learning.
arXiv Detail & Related papers (2023-12-15T10:53:09Z) - Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose
Estimation [38.97427474379367]
We introduce a denoising scheme to generate reliable pseudo-heatmaps as targets for learning from unlabeled data.
We select the learning targets from these pseudo-heatmaps guided by the estimated cross-student uncertainty.
Our results show that our model outperforms previous state-of-the-art semi-supervised pose estimators.
arXiv Detail & Related papers (2023-09-29T19:17:30Z) - Granularity at Scale: Estimating Neighborhood Socioeconomic Indicators
from High-Resolution Orthographic Imagery and Hybrid Learning [1.8369448205408005]
Overhead images can help fill in the gaps where community information is sparse.
Recent advancements in machine learning and computer vision have made it possible to quickly extract features from and detect patterns in image data.
In this work, we explore how well two approaches, a supervised convolutional neural network and semi-supervised clustering can estimate population density, median household income, and educational attainment.
arXiv Detail & Related papers (2023-09-28T19:30:26Z) - PANet: Perspective-Aware Network with Dynamic Receptive Fields and
Self-Distilling Supervision for Crowd Counting [63.84828478688975]
We propose a novel perspective-aware approach called PANet to address the perspective problem.
Based on the observation that the size of the objects varies greatly in one image due to the perspective effect, we propose the dynamic receptive fields (DRF) framework.
The framework is able to adjust the receptive field by the dilated convolution parameters according to the input image, which helps the model to extract more discriminative features for each local region.
arXiv Detail & Related papers (2021-10-31T04:43:05Z) - Classification of Urban Morphology with Deep Learning: Application on
Urban Vitality [0.0]
We propose a deep learning-based technique to automatically classify road networks into four classes on a visual basis.
Nine cities around the world are selected as the study areas and their road networks are acquired from OpenStreetMap.
Latent subgroups among the cities are uncovered through a clustering on the percentage of each road network category.
An advanced tree-based regression model is for the first time designated to establish the relationship between morphological indices and vitality indicators.
arXiv Detail & Related papers (2021-05-07T08:53:31Z) - Anomaly Detection on Attributed Networks via Contrastive Self-Supervised
Learning [50.24174211654775]
We present a novel contrastive self-supervised learning framework for anomaly detection on attributed networks.
Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair.
A graph neural network-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure.
arXiv Detail & Related papers (2021-02-27T03:17:20Z) - Shallow Feature Based Dense Attention Network for Crowd Counting [103.67446852449551]
We propose a Shallow feature based Dense Attention Network (SDANet) for crowd counting from still images.
Our method outperforms other existing methods by a large margin, as is evident from a remarkable 11.9% Mean Absolute Error (MAE) drop of our SDANet.
arXiv Detail & Related papers (2020-06-17T13:34:42Z) - Predicting Livelihood Indicators from Community-Generated Street-Level
Imagery [70.5081240396352]
We propose an inexpensive, scalable, and interpretable approach to predict key livelihood indicators from public crowd-sourced street-level imagery.
By comparing our results against ground data collected in nationally-representative household surveys, we demonstrate the performance of our approach in accurately predicting indicators of poverty, population, and health.
arXiv Detail & Related papers (2020-06-15T18:12:12Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.