Learning Economic Indicators by Aggregating Multi-Level Geospatial
Information
- URL: http://arxiv.org/abs/2205.01472v1
- Date: Tue, 3 May 2022 13:05:39 GMT
- Title: Learning Economic Indicators by Aggregating Multi-Level Geospatial
Information
- Authors: Sungwon Park, Sungwon Han, Donghyun Ahn, Jaeyeon Kim, Jeasurk Yang,
Susang Lee, Seunghoon Hong, Jihee Kim, Sangyoon Park, Hyunjoo Yang, Meeyoung
Cha
- Abstract summary: This research presents a deep learning model to predict economic indicators via aggregating traits observed from multiple levels of geographical units.
Our new multi-level learning model substantially outperforms strong baselines in predicting key indicators such as population, purchasing power, and energy consumption.
We discuss the multi-level model's implications for measuring inequality, which is the essential first step in policy and social science research on inequality and poverty.
- Score: 20.0397537179667
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: High-resolution daytime satellite imagery has become a promising source to
study economic activities. These images display detailed terrain over large
areas and allow zooming into smaller neighborhoods. Existing methods, however,
have utilized images only in a single-level geographical unit. This research
presents a deep learning model to predict economic indicators via aggregating
traits observed from multiple levels of geographical units. The model first
measures hyperlocal economy over small communities via ordinal regression. The
next step extracts district-level features by summarizing interconnection among
hyperlocal economies. In the final step, the model estimates economic
indicators of districts via aggregating the hyperlocal and district
information. Our new multi-level learning model substantially outperforms
strong baselines in predicting key indicators such as population, purchasing
power, and energy consumption. The model is also robust against data shortage;
the trained features from one country can generalize to other countries when
evaluated with data gathered from Malaysia, the Philippines, Thailand, and
Vietnam. We discuss the multi-level model's implications for measuring
inequality, which is the essential first step in policy and social science
research on inequality and poverty.
Related papers
- GeoSEE: Regional Socio-Economic Estimation With a Large Language Model [17.31652821477571]
We present GeoSEE, a method that can estimate various socio-economic indicators using a unified pipeline powered by a large language model (LLM)
The system then computes target indicators via in-context learning after aggregating results from selected modules in the format of natural language-based texts.
Our method outperforms other predictive models in both unsupervised and low-shot contexts.
arXiv Detail & Related papers (2024-06-14T07:50:22Z) - Granularity at Scale: Estimating Neighborhood Socioeconomic Indicators
from High-Resolution Orthographic Imagery and Hybrid Learning [1.8369448205408005]
Overhead images can help fill in the gaps where community information is sparse.
Recent advancements in machine learning and computer vision have made it possible to quickly extract features from and detect patterns in image data.
In this work, we explore how well two approaches, a supervised convolutional neural network and semi-supervised clustering can estimate population density, median household income, and educational attainment.
arXiv Detail & Related papers (2023-09-28T19:30:26Z) - Bias and Fairness in Large Language Models: A Survey [73.87651986156006]
We present a comprehensive survey of bias evaluation and mitigation techniques for large language models (LLMs)
We first consolidate, formalize, and expand notions of social bias and fairness in natural language processing.
We then unify the literature by proposing three intuitive, two for bias evaluation, and one for mitigation.
arXiv Detail & Related papers (2023-09-02T00:32:55Z) - Fine-Grained Socioeconomic Prediction from Satellite Images with
Distributional Adjustment [14.076490368696508]
We propose a method that assigns a socioeconomic score to each satellite image by capturing the distributional behavior observed in larger areas.
We train an ordinal regression scoring model and adjust the scores to follow the common power law within and across regions.
Our method also demonstrates robust performance in districts with uneven development, suggesting its potential use in developing countries.
arXiv Detail & Related papers (2023-08-30T12:06:04Z) - Interpreting wealth distribution via poverty map inference using
multimodal data [0.0]
We propose a pipeline of machine learning models to infer the mean and standard deviation of wealth across multiple populated places.
These models leverage seven independent and freely available feature sources based on satellite images, and metadata collected via online crowd-sourcing and social media.
Our results recover the local mean and variation of wealth, and correctly capture the positive yet non-monotonous correlation between them.
arXiv Detail & Related papers (2023-02-17T11:35:44Z) - Taxonomizing local versus global structure in neural network loss
landscapes [60.206524503782006]
We show that the best test accuracy is obtained when the loss landscape is globally well-connected.
We also show that globally poorly-connected landscapes can arise when models are small or when they are trained to lower quality data.
arXiv Detail & Related papers (2021-07-23T13:37:14Z) - Methodological Foundation of a Numerical Taxonomy of Urban Form [62.997667081978825]
We present a method for numerical taxonomy of urban form derived from biological systematics.
We derive homogeneous urban tissue types and, by determining overall morphological similarity between them, generate a hierarchical classification of urban form.
After framing and presenting the method, we test it on two cities - Prague and Amsterdam.
arXiv Detail & Related papers (2021-04-30T12:47:52Z) - Dataset Cartography: Mapping and Diagnosing Datasets with Training
Dynamics [118.75207687144817]
We introduce Data Maps, a model-based tool to characterize and diagnose datasets.
We leverage a largely ignored source of information: the behavior of the model on individual instances during training.
Our results indicate that a shift in focus from quantity to quality of data could lead to robust models and improved out-of-distribution generalization.
arXiv Detail & Related papers (2020-09-22T20:19:41Z) - Magnify Your Population: Statistical Downscaling to Augment the Spatial
Resolution of Socioeconomic Census Data [48.7576911714538]
We present a new statistical downscaling approach to derive fine-scale estimates of key socioeconomic attributes.
For each selected socioeconomic variable, a Random Forest model is trained on the source Census units and then used to generate fine-scale gridded predictions.
As a case study, we apply this method to Census data in the United States, downscaling the selected socioeconomic variables available at the block group level, to a grid of 300 spatial resolution.
arXiv Detail & Related papers (2020-06-23T16:52:18Z) - Predicting Livelihood Indicators from Community-Generated Street-Level
Imagery [70.5081240396352]
We propose an inexpensive, scalable, and interpretable approach to predict key livelihood indicators from public crowd-sourced street-level imagery.
By comparing our results against ground data collected in nationally-representative household surveys, we demonstrate the performance of our approach in accurately predicting indicators of poverty, population, and health.
arXiv Detail & Related papers (2020-06-15T18:12:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.