Multimodal Urban Areas of Interest Generation via Remote Sensing Imagery
and Geographical Prior
- URL: http://arxiv.org/abs/2401.06550v3
- Date: Thu, 8 Feb 2024 06:23:42 GMT
- Title: Multimodal Urban Areas of Interest Generation via Remote Sensing Imagery
and Geographical Prior
- Authors: Chuanji Shi, Yingying Zhang, Jiaotuan Wang, Xin Guo and Qiqi Zhu
- Abstract summary: Urban area-of-interest (AOI) refers to an integrated urban functional zone with defined polygonal boundaries.
We propose a comprehensive end-to-end multimodal deep learning framework designed for simultaneously detecting accurate AOI boundaries and validating the reliability of AOI.
- Score: 9.85003064364004
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Urban area-of-interest (AOI) refers to an integrated urban functional zone
with defined polygonal boundaries. The rapid development of urban commerce has
led to increasing demands for highly accurate and timely AOI data. However,
existing research primarily focuses on coarse-grained functional zones for
urban planning or regional economic analysis, and often neglects the expiration
of AOI in the real world. They fail to fulfill the precision demands of Mobile
Internet Online-to-Offline (O2O) businesses. These businesses require accuracy
down to a specific community, school, or hospital. In this paper, we propose a
comprehensive end-to-end multimodal deep learning framework designed for
simultaneously detecting accurate AOI boundaries and validating the reliability
of AOI by leveraging remote sensing imagery coupled with geographical prior,
titled AOITR. Unlike conventional AOI generation methods, such as the Road-cut
method that segments road networks at various levels, our approach diverges
from semantic segmentation algorithms that depend on pixel-level
classification. Instead, our AOITR begins by selecting a point-of-interest
(POI) of specific category, and uses it to retrieve corresponding remote
sensing imagery and geographical prior such as entrance POIs and road nodes.
This information helps to build a multimodal detection model based on
transformer encoder-decoder architecture to regress the AOI polygon.
Additionally, we utilize the dynamic features from human mobility, nearby POIs,
and logistics addresses for AOI reliability evaluation via a cascaded network
module. The experimental results reveal that our algorithm achieves a
significant improvement on Intersection over Union (IoU) metric, surpassing
previous methods by a large margin.
Related papers
- GOMAA-Geo: GOal Modality Agnostic Active Geo-localization [49.599465495973654]
We consider the task of active geo-localization (AGL) in which an agent uses a sequence of visual cues observed during aerial navigation to find a target specified through multiple possible modalities.
GOMAA-Geo is a goal modality active geo-localization agent for zero-shot generalization between different goal modalities.
arXiv Detail & Related papers (2024-06-04T02:59:36Z) - Translating Images to Road Network: A Sequence-to-Sequence Perspective [32.39335559663393]
Road network is essential for the generation of high-definition maps.
Existing methods struggle to merge the two types of data domains effectively.
We propose a unified representation of both types of data domain by projecting both Euclidean and non-Euclidean data into an integer series called RoadNet Sequence.
arXiv Detail & Related papers (2024-02-13T04:12:41Z) - Leveraging Topology for Domain Adaptive Road Segmentation in Satellite
and Aerial Imagery [9.23555285827483]
Road segmentation algorithms fail to generalize to new geographical locations.
Road skeleton is an auxiliary task to impose the topological constraints.
For self-training, we filter out the noisy pseudo-labels by using a connectivity-based pseudo-labels refinement strategy.
arXiv Detail & Related papers (2023-09-27T12:50:51Z) - Cross-City Matters: A Multimodal Remote Sensing Benchmark Dataset for
Cross-City Semantic Segmentation using High-Resolution Domain Adaptation
Networks [82.82866901799565]
We build a new set of multimodal remote sensing benchmark datasets (including hyperspectral, multispectral, SAR) for the study purpose of the cross-city semantic segmentation task.
Beyond the single city, we propose a high-resolution domain adaptation network, HighDAN, to promote the AI model's generalization ability from the multi-city environments.
HighDAN is capable of retaining the spatially topological structure of the studied urban scene well in a parallel high-to-low resolution fusion fashion.
arXiv Detail & Related papers (2023-09-26T23:55:39Z) - Adaptive Hierarchical SpatioTemporal Network for Traffic Forecasting [70.66710698485745]
We propose an Adaptive Hierarchical SpatioTemporal Network (AHSTN) to promote traffic forecasting.
AHSTN exploits the spatial hierarchy and modeling multi-scale spatial correlations.
Experiments on two real-world datasets show that AHSTN achieves better performance over several strong baselines.
arXiv Detail & Related papers (2023-06-15T14:50:27Z) - Semi-supervised Road Updating Network (SRUNet): A Deep Learning Method
for Road Updating from Remote Sensing Imagery and Historical Vector Maps [3.350048575501172]
We propose a road detection method based on semi-supervised learning (SRUNet) specifically for road-updating applications.
The proposed SRUNet can provide stable, up-to-date, and reliable prediction results for a wide range of road renewal tasks.
arXiv Detail & Related papers (2023-04-28T16:51:35Z) - UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark
Suite [21.565438268381467]
We introduce the road pothole detection task, the first online competition published within this benchmark suite.
Our benchmark provides a systematic and thorough evaluation of state-of-the-art object detection, semantic segmentation, and instance segmentation networks.
By providing algorithms with a more comprehensive understanding of diverse road conditions, we seek to unlock their untapped potential.
arXiv Detail & Related papers (2023-04-18T09:13:52Z) - A General Purpose Neural Architecture for Geospatial Systems [142.43454584836812]
We present a roadmap towards the construction of a general-purpose neural architecture (GPNA) with a geospatial inductive bias.
We envision how such a model may facilitate cooperation between members of the community.
arXiv Detail & Related papers (2022-11-04T09:58:57Z) - Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust
Road Extraction [110.61383502442598]
We introduce a novel neural network framework termed Cross-Modal Message Propagation Network (CMMPNet)
CMMPNet is composed of two deep Auto-Encoders for modality-specific representation learning and a tailor-designed Dual Enhancement Module for cross-modal representation refinement.
Experiments on three real-world benchmarks demonstrate the effectiveness of our CMMPNet for robust road extraction.
arXiv Detail & Related papers (2021-11-30T04:30:10Z) - Traffic Flow Forecasting with Spatial-Temporal Graph Diffusion Network [39.65520262751766]
We develop a new traffic prediction framework-Spatial-Temporal Graph Diffusion Network (ST-GDN)
In particular, ST-GDN is a hierarchically structured graph neural architecture which learns not only the local region-wise geographical dependencies, but also the spatial semantics from a global perspective.
Experiments on several real-life traffic datasets demonstrate that ST-GDN outperforms different types of state-of-the-art baselines.
arXiv Detail & Related papers (2021-10-08T11:19:06Z) - Stagewise Unsupervised Domain Adaptation with Adversarial Self-Training
for Road Segmentation of Remote Sensing Images [93.50240389540252]
Road segmentation from remote sensing images is a challenging task with wide ranges of application potentials.
We propose a novel stagewise domain adaptation model called RoadDA to address the domain shift (DS) issue in this field.
Experiment results on two benchmarks demonstrate that RoadDA can efficiently reduce the domain gap and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2021-08-28T09:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.