Deep residential representations: Using unsupervised learning to unlock
elevation data for geo-demographic prediction
- URL: http://arxiv.org/abs/2112.01421v1
- Date: Thu, 2 Dec 2021 17:10:52 GMT
- Title: Deep residential representations: Using unsupervised learning to unlock
elevation data for geo-demographic prediction
- Authors: Matthew Stevenson, Christophe Mues, Cristi\'an Bravo
- Abstract summary: LiDAR technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes.
To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains.
We consider the suitability of this data not just on its own but also as a source of data in combination with demographic features, thus providing a realistic use case for the embeddings.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: LiDAR (short for "Light Detection And Ranging" or "Laser Imaging, Detection,
And Ranging") technology can be used to provide detailed three-dimensional
elevation maps of urban and rural landscapes. To date, airborne LiDAR imaging
has been predominantly confined to the environmental and archaeological
domains. However, the geographically granular and open-source nature of this
data also lends itself to an array of societal, organizational and business
applications where geo-demographic type data is utilised. Arguably, the
complexity involved in processing this multi-dimensional data has thus far
restricted its broader adoption. In this paper, we propose a series of
convenient task-agnostic tile elevation embeddings to address this challenge,
using recent advances from unsupervised Deep Learning. We test the potential of
our embeddings by predicting seven English indices of deprivation (2019) for
small geographies in the Greater London area. These indices cover a range of
socio-economic outcomes and serve as a proxy for a wide variety of downstream
tasks to which the embeddings can be applied. We consider the suitability of
this data not just on its own but also as an auxiliary source of data in
combination with demographic features, thus providing a realistic use case for
the embeddings. Having trialled various model/embedding configurations, we find
that our best performing embeddings lead to Root-Mean-Squared-Error (RMSE)
improvements of up to 21% over using standard demographic features alone. We
also demonstrate how our embedding pipeline, using Deep Learning combined with
K-means clustering, produces coherent tile segments which allow the latent
embedding features to be interpreted.
Related papers
- Enabling Advanced Land Cover Analytics: An Integrated Data Extraction Pipeline for Predictive Modeling with the Dynamic World Dataset [1.3757956340051605]
We present a flexible and efficient end to end pipeline for working with the Dynamic World dataset.
This includes a pre-processing and representation framework which tackles noise removal, efficient extraction of large amounts of data, and re-representation of LULC data.
To demonstrate the power of our pipeline, we use it to extract data for an urbanization prediction problem and build a suite of machine learning models with excellent performance.
arXiv Detail & Related papers (2024-10-11T16:13:01Z) - Into the Unknown: Generating Geospatial Descriptions for New Environments [18.736071151303726]
Rendezvous task requires reasoning over allocentric spatial relationships.
Using opensource descriptions paired with coordinates (e.g., Wikipedia) provides training data but suffers from limited spatially-oriented text.
We propose a large-scale augmentation method for generating high-quality synthetic data for new environments.
arXiv Detail & Related papers (2024-06-28T14:56:21Z) - GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models.
We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods.
Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z) - LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting [65.71129509623587]
Road traffic forecasting plays a critical role in smart city initiatives and has experienced significant advancements thanks to the power of deep learning.
However, the promising results achieved on current public datasets may not be applicable to practical scenarios.
We introduce the LargeST benchmark dataset, which includes a total of 8,600 sensors in California with a 5-year time coverage.
arXiv Detail & Related papers (2023-06-14T05:48:36Z) - A General Purpose Neural Architecture for Geospatial Systems [142.43454584836812]
We present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias.
We envision how such a model may facilitate cooperation between members of the community.
arXiv Detail & Related papers (2022-11-04T09:58:57Z) - DOORS: Dataset fOr bOuldeRs Segmentation. Statistical properties and
Blender setup [0.0]
The capability to detect boulders on the surface of small bodies is beneficial for vision-based applications such as hazard detection during critical operations and navigation.
This work provides a statistical characterization and setup used for the generation of two datasets about boulders on small bodies that are made publicly available.
arXiv Detail & Related papers (2022-10-28T16:39:06Z) - Semantic Segmentation of Vegetation in Remote Sensing Imagery Using Deep
Learning [77.34726150561087]
We propose an approach for creating a multi-modal and large-temporal dataset comprised of publicly available Remote Sensing data.
We use Convolutional Neural Networks (CNN) models that are capable of separating different classes of vegetation.
arXiv Detail & Related papers (2022-09-28T18:51:59Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - GANmapper: geographical content filling [0.0]
We present a new method to create spatial data using a generative adversarial network (GAN)
Our contribution uses coarse and widely available geospatial data to create maps of less available features at the finer scale in the built environment.
We employ land use data and road networks as input to generate building footprints, and conduct experiments in 9 cities around the world.
arXiv Detail & Related papers (2021-08-07T05:50:54Z) - Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset,
Benchmarks and Challenges [52.624157840253204]
We present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points.
Our dataset consists of large areas from three UK cities, covering about 7.6 km2 of the city landscape.
We evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results.
arXiv Detail & Related papers (2020-09-07T14:47:07Z) - Learning and Recognizing Archeological Features from LiDAR Data [8.135393502095909]
We present a remote sensing pipeline that processes LiDAR (Light Detection And Ranging) data through machine & deep learning.
We aim at identifying geo-spatial areas with archeological artifacts in a supervised fashion allowing the domain expert to flexibly tune parameters based on her needs.
arXiv Detail & Related papers (2020-04-05T05:36:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.