Knowledge-infused Contrastive Learning for Urban Imagery-based
Socioeconomic Prediction
- URL: http://arxiv.org/abs/2302.13094v1
- Date: Sat, 25 Feb 2023 14:53:17 GMT
- Title: Knowledge-infused Contrastive Learning for Urban Imagery-based
Socioeconomic Prediction
- Authors: Yu Liu, Xin Zhang, Jingtao Ding, Yanxin Xi, Yong Li
- Abstract summary: Urban imagery in web like satellite/street view images has emerged as an important source for socioeconomic prediction.
We propose a Knowledge-infused Contrastive Learning model for urban imagery-based socioeconomic prediction.
Our proposed KnowCL model can apply to both satellite and street imagery with both effectiveness and transferability achieved.
- Score: 13.26632316765164
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Monitoring sustainable development goals requires accurate and timely
socioeconomic statistics, while ubiquitous and frequently-updated urban imagery
in web like satellite/street view images has emerged as an important source for
socioeconomic prediction. Especially, recent studies turn to self-supervised
contrastive learning with manually designed similarity metrics for urban
imagery representation learning and further socioeconomic prediction, which
however suffers from effectiveness and robustness issues. To address such
issues, in this paper, we propose a Knowledge-infused Contrastive Learning
(KnowCL) model for urban imagery-based socioeconomic prediction. Specifically,
we firstly introduce knowledge graph (KG) to effectively model the urban
knowledge in spatiality, mobility, etc., and then build neural network based
encoders to learn representations of an urban image in associated semantic and
visual spaces, respectively. Finally, we design a cross-modality based
contrastive learning framework with a novel image-KG contrastive loss, which
maximizes the mutual information between semantic and visual representations
for knowledge infusion. Extensive experiments of applying the learnt visual
representations for socioeconomic prediction on three datasets demonstrate the
superior performance of KnowCL with over 30\% improvements on $R^2$ compared
with baselines. Especially, our proposed KnowCL model can apply to both
satellite and street imagery with both effectiveness and transferability
achieved, which provides insights into urban imagery-based socioeconomic
prediction.
Related papers
- MuseCL: Predicting Urban Socioeconomic Indicators via Multi-Semantic Contrastive Learning [13.681538916025021]
MuseCL is a framework for fine-grained urban region profiling and socioeconomic prediction.
We construct contrastive sample pairs for street view and remote sensing images, capitalizing on similarities in human mobility.
We extract semantic insights from POI texts embedded within these regions, employing a pre-trained text encoder.
arXiv Detail & Related papers (2024-06-23T09:49:41Z) - Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes [15.842647992041751]
Post-hoc concept-based explanations can be an important step towards broader adoption of these models in policy-making.
We study the interplay between representation learning using an additional task-specific contrastive loss and post-hoc concept explainability for socioeconomic studies.
arXiv Detail & Related papers (2024-04-15T13:13:56Z) - Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization [62.157627519792946]
We introduce a novel framework called bridged transfer, which initially employs synthetic images for fine-tuning a pre-trained model to improve its transferability.
We propose dataset style inversion strategy to improve the stylistic alignment between synthetic and real images.
Our proposed methods are evaluated across 10 different datasets and 5 distinct models, demonstrating consistent improvements.
arXiv Detail & Related papers (2024-03-28T22:25:05Z) - UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web [37.332601383723585]
This paper introduces the first-ever framework that integrates the knowledge of textual modality into urban imagery profiling.
It generates a detailed textual description for each satellite image by an open-source Image-to-Text LLM.
The model is trained on the image-text pairs, seamlessly unifying natural language supervision for urban visual representation learning.
arXiv Detail & Related papers (2023-10-22T02:32:53Z) - A Novel Neural-symbolic System under Statistical Relational Learning [50.747658038910565]
We propose a general bi-level probabilistic graphical reasoning framework called GBPGR.
In GBPGR, the results of symbolic reasoning are utilized to refine and correct the predictions made by the deep learning models.
Our approach achieves high performance and exhibits effective generalization in both transductive and inductive tasks.
arXiv Detail & Related papers (2023-09-16T09:15:37Z) - Mitigating Urban-Rural Disparities in Contrastive Representation Learning with Satellite Imagery [19.93324644519412]
We consider the risk of urban-rural disparities in identification of land-cover features.
We propose fair dense representation with contrastive learning (FairDCL) as a method for de-biasing the multi-level latent space of convolution neural network models.
The obtained image representation mitigates downstream urban-rural prediction disparities and outperforms state-of-the-art baselines on real-world satellite images.
arXiv Detail & Related papers (2022-11-16T04:59:46Z) - Exploring CLIP for Assessing the Look and Feel of Images [87.97623543523858]
We introduce Contrastive Language-Image Pre-training (CLIP) models for assessing both the quality perception (look) and abstract perception (feel) of images in a zero-shot manner.
Our results show that CLIP captures meaningful priors that generalize well to different perceptual assessments.
arXiv Detail & Related papers (2022-07-25T17:58:16Z) - Causal Reasoning Meets Visual Representation Learning: A Prospective
Study [117.08431221482638]
Lack of interpretability, robustness, and out-of-distribution generalization are becoming the challenges of the existing visual models.
Inspired by the strong inference ability of human-level agents, recent years have witnessed great effort in developing causal reasoning paradigms.
This paper aims to provide a comprehensive overview of this emerging field, attract attention, encourage discussions, bring to the forefront the urgency of developing novel causal reasoning methods.
arXiv Detail & Related papers (2022-04-26T02:22:28Z) - Enhancing Social Relation Inference with Concise Interaction Graph and
Discriminative Scene Representation [56.25878966006678]
We propose an approach of textbfPRactical textbfInference in textbfSocial rtextbfElation (PRISE)
It concisely learns interactive features of persons and discriminative features of holistic scenes.
PRISE achieves 6.8$%$ improvement for domain classification in PIPA dataset.
arXiv Detail & Related papers (2021-07-30T04:20:13Z) - Predicting Livelihood Indicators from Community-Generated Street-Level
Imagery [70.5081240396352]
We propose an inexpensive, scalable, and interpretable approach to predict key livelihood indicators from public crowd-sourced street-level imagery.
By comparing our results against ground data collected in nationally-representative household surveys, we demonstrate the performance of our approach in accurately predicting indicators of poverty, population, and health.
arXiv Detail & Related papers (2020-06-15T18:12:12Z) - Socioeconomic correlations of urban patterns inferred from aerial
images: interpreting activation maps of Convolutional Neural Networks [0.10152838128195464]
Urbanisation is a great challenge for modern societies, promising better access to economic opportunities while widening socioeconomic inequalities.
Here we close this gap by predicting socioeconomic status across France from aerial images and interpreting class activation mappings in terms of urban topology.
These results pave the way to build interpretable models, which may help to better track and understand urbanisation and its consequences.
arXiv Detail & Related papers (2020-04-10T04:57:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.