OpenUAS: Embeddings of Cities in Japan with Anchor Data for Cross-city Analysis of Area Usage Patterns
- URL: http://arxiv.org/abs/2407.19872v3
- Date: Tue, 12 Nov 2024 09:57:00 GMT
- Title: OpenUAS: Embeddings of Cities in Japan with Anchor Data for Cross-city Analysis of Area Usage Patterns
- Authors: Naoki Tamura, Kazuyuki Shoji, Shin Katayama, Kenta Urano, Takuro Yonezawa, Nobuo Kawaguchi,
- Abstract summary: This dataset is valuable for analyzing area functions in fields such as market analysis, urban planning, transportation infrastructure, and infection prediction.
It captures the characteristics of each area in the city, such as office districts and residential areas, by employing an area embedding technique.
We publicly release this anchor dataset along with area embedding datasets from several periods in eight major Japanese cities.
- Score: 0.9530591903982806
- License:
- Abstract: We publicly release OpenUAS, a dataset of area embeddings based on urban usage patterns, including embeddings for over 1.3 million 50-meter square meshes covering a total area of 3,300 square kilometers. This dataset is valuable for analyzing area functions in fields such as market analysis, urban planning, transportation infrastructure, and infection prediction. It captures the characteristics of each area in the city, such as office districts and residential areas, by employing an area embedding technique that utilizes location information typically obtained by GPS. Numerous area embedding techniques have been proposed, and while the public release of such embedding datasets is technically feasible, it has not been realized. One reason for this is that previous methods could not embed areas from different cities and periods into the same embedding space without sharing raw location data. We address this issue by developing an anchoring method that establishes anchors within a shared embedding space. We publicly release this anchor dataset along with area embedding datasets from several periods in eight major Japanese cities.
Related papers
- Area Modeling using Stay Information for Large-Scale Users and Analysis
for Influence of COVID-19 [2.6044668705725287]
Area usage is subject to change over time due to various events including seasonal shifts and pandemics.
There are many existing studies on area modeling, which characterize an area with some kind of information.
We propose a novel area modeling method named Area2Vec, inspired by Word2Vec, which models areas based on people's location data.
arXiv Detail & Related papers (2024-01-19T11:48:52Z) - LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting [65.71129509623587]
Road traffic forecasting plays a critical role in smart city initiatives and has experienced significant advancements thanks to the power of deep learning.
However, the promising results achieved on current public datasets may not be applicable to practical scenarios.
We introduce the LargeST benchmark dataset, which includes a total of 8,600 sensors in California with a 5-year time coverage.
arXiv Detail & Related papers (2023-06-14T05:48:36Z) - Dataset of Pathloss and ToA Radio Maps With Localization Application [59.11388233415274]
The datasets include simulated pathloss/received signal strength ( RSS) and time of arrival ( ToA) radio maps over a large collection of realistic dense urban setting in real city maps.
The two main applications of the presented dataset are 1) learning methods that predict the pathloss from input city maps, and, 2) wireless localization.
The fact that the RSS and ToA maps are computed by the same simulations over the same city maps allows for a fair comparison of the RSS and ToA-based localization methods.
arXiv Detail & Related papers (2022-11-18T20:39:51Z) - SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point
Clouds [52.624157840253204]
We introduce SensatUrban, an urban-scale UAV photogrammetry point cloud dataset consisting of nearly three billion points collected from three UK cities, covering 7.6 km2.
Each point in the dataset has been labelled with fine-grained semantic annotations, resulting in a dataset that is three times the size of the previous existing largest photogrammetric point cloud dataset.
arXiv Detail & Related papers (2022-01-12T14:48:11Z) - A Visual Analytics System for Profiling Urban Land Use Evolution [5.053505466956614]
Urban Chronicles is an open-source web-based visual analytics system that enables interactive exploration of changes in land use patterns.
We show the capabilities of the system by exploring the data over several years at different scales.
arXiv Detail & Related papers (2021-12-12T02:36:54Z) - Deep residential representations: Using unsupervised learning to unlock
elevation data for geo-demographic prediction [0.0]
LiDAR technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes.
To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains.
We consider the suitability of this data not just on its own but also as a source of data in combination with demographic features, thus providing a realistic use case for the embeddings.
arXiv Detail & Related papers (2021-12-02T17:10:52Z) - GANmapper: geographical content filling [0.0]
We present a new method to create spatial data using a generative adversarial network (GAN)
Our contribution uses coarse and widely available geospatial data to create maps of less available features at the finer scale in the built environment.
We employ land use data and road networks as input to generate building footprints, and conduct experiments in 9 cities around the world.
arXiv Detail & Related papers (2021-08-07T05:50:54Z) - Semantic Segmentation on Swiss3DCities: A Benchmark Study on Aerial
Photogrammetric 3D Pointcloud Dataset [67.44497676652173]
We introduce a new outdoor urban 3D pointcloud dataset, covering a total area of 2.7 $km2$, sampled from three Swiss cities.
The dataset is manually annotated for semantic segmentation with per-point labels, and is built using photogrammetry from images acquired by multirotors equipped with high-resolution cameras.
arXiv Detail & Related papers (2020-12-23T21:48:47Z) - Sketch and Scale: Geo-distributed tSNE and UMAP [75.44887265789056]
Running machine learning analytics over geographically distributed datasets is a rapidly arising problem.
We introduce a novel framework: Sketch and Scale (SnS)
It leverages a Count Sketch data structure to compress the data on the edge nodes, aggregates the reduced size sketches on the master node, and runs vanilla tSNE or UMAP on the summary.
We show this technique to be fully parallel, scale linearly in time, logarithmically in memory, and communication, making it possible to analyze datasets with many millions, potentially billions of data points, spread across several data centers around the globe.
arXiv Detail & Related papers (2020-11-11T22:32:21Z) - Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset,
Benchmarks and Challenges [52.624157840253204]
We present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points.
Our dataset consists of large areas from three UK cities, covering about 7.6 km2 of the city landscape.
We evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results.
arXiv Detail & Related papers (2020-09-07T14:47:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.