Semantic-Enhanced Representation Learning for Road Networks with Temporal Dynamics
- URL: http://arxiv.org/abs/2403.11495v1
- Date: Mon, 18 Mar 2024 05:59:56 GMT
- Title: Semantic-Enhanced Representation Learning for Road Networks with Temporal Dynamics
- Authors: Yile Chen, Xiucheng Li, Gao Cong, Zhifeng Bao, Cheng Long,
- Abstract summary: We introduce a novel framework called Toast for learning general-purpose representations of road networks, along with its advanced counterpart DyToast.
Specifically, we propose to encode two pivotal semantic characteristics intrinsic to road networks: traffic patterns and traveling semantics.
Our proposed framework consistently outperforms the state-of-the-art baselines by a significant margin.
- Score: 33.940044533340235
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this study, we introduce a novel framework called Toast for learning general-purpose representations of road networks, along with its advanced counterpart DyToast, designed to enhance the integration of temporal dynamics to boost the performance of various time-sensitive downstream tasks. Specifically, we propose to encode two pivotal semantic characteristics intrinsic to road networks: traffic patterns and traveling semantics. To achieve this, we refine the skip-gram module by incorporating auxiliary objectives aimed at predicting the traffic context associated with a target road segment. Moreover, we leverage trajectory data and design pre-training strategies based on Transformer to distill traveling semantics on road networks. DyToast further augments this framework by employing unified trigonometric functions characterized by their beneficial properties, enabling the capture of temporal evolution and dynamic nature of road networks more effectively. With these proposed techniques, we can obtain representations that encode multi-faceted aspects of knowledge within road networks, applicable across both road segment-based applications and trajectory-based applications. Extensive experiments on two real-world datasets across three tasks demonstrate that our proposed framework consistently outperforms the state-of-the-art baselines by a significant margin.
Related papers
- Trajectory Representation Learning on Road Networks and Grids with Spatio-Temporal Dynamics [0.8655526882770742]
Trajectory representation learning is a fundamental task for applications in fields including smart city, and urban planning.
We propose TIGR, a novel model designed to integrate grid and road network modalities while incorporatingtemporal dynamics.
We evaluate TIGR on two realworld datasets and demonstrate the effectiveness of combining both modalities.
arXiv Detail & Related papers (2024-11-21T10:56:02Z) - Cross-Domain Transfer Learning using Attention Latent Features for Multi-Agent Trajectory Prediction [4.292918274985369]
We propose a novel spatial-temporal trajectory prediction framework that performs cross-domain adaption on the attention representation of a Transformer-based model.
A graph convolutional network is also integrated to construct dynamic graph feature embeddings that accurately model the complex spatial-temporal interactions between the multi-agent vehicles.
arXiv Detail & Related papers (2024-11-09T06:39:44Z) - Image Captioning via Dynamic Path Customization [100.15412641586525]
We propose a novel Dynamic Transformer Network (DTNet) for image captioning, which dynamically assigns customized paths to different samples, leading to discriminative yet accurate captions.
To validate the effectiveness of our proposed DTNet, we conduct extensive experiments on the MS-COCO dataset and achieve new state-of-the-art performance.
arXiv Detail & Related papers (2024-06-01T07:23:21Z) - Fine-Grained Extraction of Road Networks via Joint Learning of
Connectivity and Segmentation [5.496893845821393]
Road network extraction from satellite images is widely applicated in intelligent traffic management and autonomous driving fields.
The high-resolution remote sensing images contain complex road areas and distracted background, which make it a challenge for road extraction.
We present a stacked multitask network for end-to-end segmenting roads while preserving connectivity correctness.
arXiv Detail & Related papers (2023-12-07T22:57:17Z) - A Deeply Supervised Semantic Segmentation Method Based on GAN [9.441379867578332]
The proposed model integrates a generative adversarial network (GAN) framework into the traditional semantic segmentation model.
The effectiveness of our approach is demonstrated by a significant boost in performance on the road crack dataset.
arXiv Detail & Related papers (2023-10-06T08:22:24Z) - Dynamic Spatial Sparsification for Efficient Vision Transformers and
Convolutional Neural Networks [88.77951448313486]
We present a new approach for model acceleration by exploiting spatial sparsity in visual data.
We propose a dynamic token sparsification framework to prune redundant tokens.
We extend our method to hierarchical models including CNNs and hierarchical vision Transformers.
arXiv Detail & Related papers (2022-07-04T17:00:51Z) - Learning to integrate vision data into road network data [14.86655504533083]
Road networks are the core infrastructure for connected and autonomous vehicles.
We propose to integrate remote sensing vision data into network data for improved embeddings with graph neural networks.
We achieve state-of-the-art performance on the OSM+Di Chuxing dataset on Chengdu, China.
arXiv Detail & Related papers (2021-12-20T15:38:49Z) - Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust
Road Extraction [110.61383502442598]
We introduce a novel neural network framework termed Cross-Modal Message Propagation Network (CMMPNet)
CMMPNet is composed of two deep Auto-Encoders for modality-specific representation learning and a tailor-designed Dual Enhancement Module for cross-modal representation refinement.
Experiments on three real-world benchmarks demonstrate the effectiveness of our CMMPNet for robust road extraction.
arXiv Detail & Related papers (2021-11-30T04:30:10Z) - Road Network Guided Fine-Grained Urban Traffic Flow Inference [108.64631590347352]
Accurate inference of fine-grained traffic flow from coarse-grained one is an emerging yet crucial problem.
We propose a novel Road-Aware Traffic Flow Magnifier (RATFM) that exploits the prior knowledge of road networks.
Our method can generate high-quality fine-grained traffic flow maps.
arXiv Detail & Related papers (2021-09-29T07:51:49Z) - Spatio-temporal Modeling for Large-scale Vehicular Networks Using Graph
Convolutional Networks [110.80088437391379]
A graph-based framework called SMART is proposed to model and keep track of the statistics of vehicle-to-temporal (V2I) communication latency across a large geographical area.
We develop a graph reconstruction-based approach using a graph convolutional network integrated with a deep Q-networks algorithm.
Our results show that the proposed method can significantly improve both the accuracy and efficiency for modeling and the latency performance of large vehicular networks.
arXiv Detail & Related papers (2021-03-13T06:56:29Z) - VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized
Representation [74.56282712099274]
This paper introduces VectorNet, a hierarchical graph neural network that exploits the spatial locality of individual road components represented by vectors.
By operating on the vectorized high definition (HD) maps and agent trajectories, we avoid lossy rendering and computationally intensive ConvNet encoding steps.
We evaluate VectorNet on our in-house behavior prediction benchmark and the recently released Argoverse forecasting dataset.
arXiv Detail & Related papers (2020-05-08T19:07:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.