STDA-Meta: A Meta-Learning Framework for Few-Shot Traffic Prediction
- URL: http://arxiv.org/abs/2310.20223v1
- Date: Tue, 31 Oct 2023 06:52:56 GMT
- Title: STDA-Meta: A Meta-Learning Framework for Few-Shot Traffic Prediction
- Authors: Maoxiang Sun, Weilong Ding, Tianpu Zhang, Zijian Liu, Mengda Xing
- Abstract summary: We propose a novel-temporal domain adaptation (STDA) method that learns transferable meta-knowledge from data-sufficient cities in an adversarial manner.
This learned meta-knowledge can improve prediction performance of data-scarce cities.
Specifically, we train the STDA model using a Model-Atemporal Meta-Learning (MAML) based episode learning process.
- Score: 5.502177196766933
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As the development of cities, traffic congestion becomes an increasingly
pressing issue, and traffic prediction is a classic method to relieve that
issue. Traffic prediction is one specific application of spatio-temporal
prediction learning, like taxi scheduling, weather prediction, and ship
trajectory prediction. Against these problems, classical spatio-temporal
prediction learning methods including deep learning, require large amounts of
training data. In reality, some newly developed cities with insufficient
sensors would not hold that assumption, and the data scarcity makes predictive
performance worse. In such situation, the learning method on insufficient data
is known as few-shot learning (FSL), and the FSL of traffic prediction remains
challenges. On the one hand, graph structures' irregularity and dynamic nature
of graphs cannot hold the performance of spatio-temporal learning method. On
the other hand, conventional domain adaptation methods cannot work well on
insufficient training data, when transferring knowledge from different domains
to the intended target domain.To address these challenges, we propose a novel
spatio-temporal domain adaptation (STDA) method that learns transferable
spatio-temporal meta-knowledge from data-sufficient cities in an adversarial
manner. This learned meta-knowledge can improve the prediction performance of
data-scarce cities. Specifically, we train the STDA model using a
Model-Agnostic Meta-Learning (MAML) based episode learning process, which is a
model-agnostic meta-learning framework that enables the model to solve new
learning tasks using only a small number of training samples. We conduct
numerous experiments on four traffic prediction datasets, and our results show
that the prediction performance of our model has improved by 7\% compared to
baseline models on the two metrics of MAE and RMSE.
Related papers
- What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? [83.83230167222852]
We find that a model's generalization behavior can be effectively characterized by a training metric we call pre-memorization train accuracy.
By connecting a model's learning behavior to its generalization, pre-memorization train accuracy can guide targeted improvements to training strategies.
arXiv Detail & Related papers (2024-11-12T09:52:40Z) - Strada-LLM: Graph LLM for traffic prediction [62.2015839597764]
A considerable challenge in traffic prediction lies in handling the diverse data distributions caused by vastly different traffic conditions.
We propose a graph-aware LLM for traffic prediction that considers proximal traffic information.
We adopt a lightweight approach for efficient domain adaptation when facing new data distributions in few-shot fashion.
arXiv Detail & Related papers (2024-10-28T09:19:29Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models [27.306180426294784]
We introduce TPLLM, a novel traffic prediction framework leveraging Large Language Models (LLMs)
In this framework, we construct a sequence embedding layer based on Conal Neural Networks (LoCNNs) and a graph embedding layer based on Graph Contemporalal Networks (GCNs) to extract sequence features and spatial features.
Experiments on two real-world datasets demonstrate commendable performance in both full-sample and few-shot prediction scenarios.
arXiv Detail & Related papers (2024-03-04T17:08:57Z) - Pre-training on Synthetic Driving Data for Trajectory Prediction [61.520225216107306]
We propose a pipeline-level solution to mitigate the issue of data scarcity in trajectory forecasting.
We adopt HD map augmentation and trajectory synthesis for generating driving data, and then we learn representations by pre-training on them.
We conduct extensive experiments to demonstrate the effectiveness of our data expansion and pre-training strategies.
arXiv Detail & Related papers (2023-09-18T19:49:22Z) - EANet: Expert Attention Network for Online Trajectory Prediction [5.600280639034753]
Expert Attention Network is a complete online learning framework for trajectory prediction.
We introduce expert attention, which adjusts the weights of different depths of network layers, avoiding the model updated slowly due to gradient problem.
Furthermore, we propose a short-term motion trend kernel function which is sensitive to scenario change, allowing the model to respond quickly.
arXiv Detail & Related papers (2023-09-11T07:09:40Z) - NodeTrans: A Graph Transfer Learning Approach for Traffic Prediction [33.299309349152146]
We propose a novel transfer learning approach to solve the traffic prediction with few data.
First, a spatial-temporal graph neural network is proposed, which can capture the node-specific spatial-temporal traffic patterns of different road networks.
arXiv Detail & Related papers (2022-07-04T10:06:20Z) - Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge
Transfer [58.6106391721944]
Cross-city knowledge has shown its promise, where the model learned from data-sufficient cities is leveraged to benefit the learning process of data-scarce cities.
We propose a model-agnostic few-shot learning framework for S-temporal graph called ST-GFSL.
We conduct comprehensive experiments on four traffic speed prediction benchmarks and the results demonstrate the effectiveness of ST-GFSL compared with state-of-the-art methods.
arXiv Detail & Related papers (2022-05-27T12:46:52Z) - Imputation-Free Learning from Incomplete Observations [73.15386629370111]
We introduce the importance of guided gradient descent (IGSGD) method to train inference from inputs containing missing values without imputation.
We employ reinforcement learning (RL) to adjust the gradients used to train the models via back-propagation.
Our imputation-free predictions outperform the traditional two-step imputation-based predictions using state-of-the-art imputation methods.
arXiv Detail & Related papers (2021-07-05T12:44:39Z) - Transfer Learning and Online Learning for Traffic Forecasting under
Different Data Availability Conditions: Alternatives and Pitfalls [7.489793155793319]
This work aims at unveiling the potential of Transfer Learning (TL) for developing a traffic flow forecasting model in scenarios of absent data.
Traditional batch learning is compared against TL based models using real traffic flow data, collected by deployed loops managed by the City Council of Madrid (Spain)
In addition, we apply Online Learning (OL) techniques, where model receives an update after each prediction, in order to adapt to traffic flow trend changes and incrementally learn from new incoming traffic data.
arXiv Detail & Related papers (2020-05-08T10:53:49Z) - New Perspectives on the Use of Online Learning for Congestion Level
Prediction over Traffic Data [6.664111208927475]
This work focuses on classification over time series data.
When a time series is generated by non-stationary phenomena, the pattern relating the series with the class to be predicted may evolve over time.
Online learning methods incrementally learn from new data samples arriving over time, and accommodate eventual changes along the data stream.
arXiv Detail & Related papers (2020-03-27T09:44:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.