DelayPTC-LLM: Metro Passenger Travel Choice Prediction under Train Delays with Large Language Models
- URL: http://arxiv.org/abs/2410.00052v1
- Date: Sat, 28 Sep 2024 13:09:15 GMT
- Title: DelayPTC-LLM: Metro Passenger Travel Choice Prediction under Train Delays with Large Language Models
- Authors: Chen Chen, Yuxin He, Hao Wang, Jingjing Chen, Qin Luo,
- Abstract summary: This paper proposes a novel Passenger Travel Choice prediction framework under metro delays with the Large Language Model (DelayPTC-LLM)
A comparative analysis of DelayPTC-LLM with traditional prediction models demonstrates the superior capability of LLMs in handling complex, sparse datasets commonly encountered under disruption of transportation systems.
- Score: 31.509436717815102
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Train delays can propagate rapidly throughout the Urban Rail Transit (URT) network under networked operation conditions, posing significant challenges to operational departments. Accurately predicting passenger travel choices under train delays can provide interpretable insights into the redistribution of passenger flow, offering crucial decision support for emergency response and service recovery. However, the diversity of travel choices due to passenger heterogeneity and the sparsity of delay events leads to issues of data sparsity and sample imbalance in the travel choices dataset under metro delays. It is challenging to model this problem using traditional machine learning approaches, which typically rely on large, balanced datasets. Given the strengths of large language models (LLMs) in text processing, understanding, and their capabilities in small-sample and even zero-shot learning, this paper proposes a novel Passenger Travel Choice prediction framework under metro delays with the Large Language Model (DelayPTC-LLM). The well-designed prompting engineering is developed to guide the LLM in making and rationalizing predictions about travel choices, taking into account passenger heterogeneity and features of the delay events. Utilizing real-world data from Shenzhen Metro, including Automated Fare Collection (AFC) data and detailed delay logs, a comparative analysis of DelayPTC-LLM with traditional prediction models demonstrates the superior capability of LLMs in handling complex, sparse datasets commonly encountered under disruption of transportation systems. The results validate the advantages of DelayPTC-LLM in terms of predictive accuracy and its potential to provide actionable insights for big traffic data.
Related papers
- Improving Traffic Flow Predictions with SGCN-LSTM: A Hybrid Model for Spatial and Temporal Dependencies [55.2480439325792]
This paper introduces the Signal-Enhanced Graph Convolutional Network Long Short Term Memory (SGCN-LSTM) model for predicting traffic speeds across road networks.
Experiments on the PEMS-BAY road network traffic dataset demonstrate the SGCN-LSTM model's effectiveness.
arXiv Detail & Related papers (2024-11-01T00:37:00Z) - Strada-LLM: Graph LLM for traffic prediction [62.2015839597764]
A considerable challenge in traffic prediction lies in handling the diverse data distributions caused by vastly different traffic conditions.
We propose a graph-aware LLM for traffic prediction that considers proximal traffic information.
We adopt a lightweight approach for efficient domain adaptation when facing new data distributions in few-shot fashion.
arXiv Detail & Related papers (2024-10-28T09:19:29Z) - A Prompt Refinement-based Large Language Model for Metro Passenger Flow Forecasting under Delay Conditions [30.552007081903263]
Short-term forecasts of passenger flow in metro systems under delay conditions are crucial for emergency response and service recovery.
Due to the rare occurrence of delay events, the limited sample size under delay conditions make it difficult for conventional models to capture the complex impacts of delays on passenger flow.
We propose a passenger flow forecasting framework that synthesizes an LLM with carefully designed prompt engineering.
arXiv Detail & Related papers (2024-10-19T13:46:46Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - R-SFLLM: Jamming Resilient Framework for Split Federated Learning with Large Language Models [83.77114091471822]
Split federated learning (SFL) is a compute-efficient paradigm in distributed machine learning (ML)
A challenge in SFL, particularly when deployed over wireless channels, is the susceptibility of transmitted model parameters to adversarial jamming.
This is particularly pronounced for word embedding parameters in large language models (LLMs), which are crucial for language understanding.
A physical layer framework is developed for resilient SFL with LLMs (R-SFLLM) over wireless networks.
arXiv Detail & Related papers (2024-07-16T12:21:29Z) - Streaming detection of significant delay changes in public transport systems [0.9217021281095907]
Disruptions in public transport such as delays may negatively affect mobility choices.
We propose both the method detecting significant delays and reference architecture.
Method can complement the calculation of delays defined as deviation from schedules.
arXiv Detail & Related papers (2024-04-11T15:54:20Z) - Meta-learning Based Short-Term Passenger Flow Prediction for
Newly-Operated Urban Rail Transit Stations [3.718942345103135]
We propose a meta-learning method named Meta Long Short-Term Memory Network (Meta-LSTM) to predict the passenger flow in newly-operated stations.
The Meta-LSTM is applied to the subway network of Nanning, Hangzhou, and Beijing, China.
arXiv Detail & Related papers (2022-10-13T15:27:28Z) - Interpretable Machine Learning Models for Modal Split Prediction in
Transportation Systems [0.43012765978447565]
Modal split prediction in transportation networks has the potential to support network operators in managing traffic congestion and improving transit service reliability.
We focus on the problem of hourly prediction of the fraction of travelers choosing one mode of transportation over another using high-dimensional travel time data.
We employ various regularization techniques for variable selection to prevent overfitting and resolve multicollinearity issues.
arXiv Detail & Related papers (2022-03-27T02:59:00Z) - Multi-Airport Delay Prediction with Transformers [0.0]
Temporal Fusion Transformer (TFT) was proposed to predict departure and arrival delays simultaneously for multiple airports.
This approach can capture complex temporal dynamics of the inputs known at the time of prediction and then forecast selected delay metrics up to four hours into the future.
arXiv Detail & Related papers (2021-11-04T21:58:11Z) - Federated Learning with Correlated Data: Taming the Tail for Age-Optimal
Industrial IoT [55.62157530259969]
We study a sensor's transmit power minimization subject to the peak-AoI requirement and a probabilistic constraint on queuing latency.
We propose a local-model selection approach which accounts for correlation among the sensor's training data.
Numerical results show the tradeoff between the transmit power, peak AoI, and delay's tail distribution.
arXiv Detail & Related papers (2021-08-17T08:38:31Z) - FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention [88.33372574562824]
We propose a novel framework based on feed-forward network (FFN) for ETA, FFN with Multi-factor self-Attention (FMA-ETA)
The novel Multi-factor self-attention mechanism is proposed to deal with different category features and aggregate the information purposefully.
Experiments show FMA-ETA is competitive with state-of-the-art methods in terms of the prediction accuracy with significantly better inference speed.
arXiv Detail & Related papers (2020-06-07T08:10:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.