Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction
- URL: http://arxiv.org/abs/2406.12923v1
- Date: Fri, 14 Jun 2024 12:57:17 GMT
- Title: Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction
- Authors: Wenzhao Jiang, Jindong Han, Hao Liu, Tao Tao, Naiqiang Tan, Hui Xiong,
- Abstract summary: Rapid urbanization has significantly escalated traffic congestion, underscoring the need for advanced congestion prediction services.
We introduce a Congestion Prediction Mixture-of-Experts, CP-MoE, to address the challenges.
CP-MoE has been deployed in DiDi to improve the accuracy and reliability of the travel time estimation system.
- Score: 24.26429523848735
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Rapid urbanization has significantly escalated traffic congestion, underscoring the need for advanced congestion prediction services to bolster intelligent transportation systems. As one of the world's largest ride-hailing platforms, DiDi places great emphasis on the accuracy of congestion prediction to enhance the effectiveness and reliability of their real-time services, such as travel time estimation and route planning. Despite numerous efforts have been made on congestion prediction, most of them fall short in handling heterogeneous and dynamic spatio-temporal dependencies (e.g., periodic and non-periodic congestions), particularly in the presence of noisy and incomplete traffic data. In this paper, we introduce a Congestion Prediction Mixture-of-Experts, CP-MoE, to address the above challenges. We first propose a sparsely-gated Mixture of Adaptive Graph Learners (MAGLs) with congestion-aware inductive biases to improve the model capacity for efficiently capturing complex spatio-temporal dependencies in varying traffic scenarios. Then, we devise two specialized experts to help identify stable trends and periodic patterns within the traffic data, respectively. By cascading these experts with MAGLs, CP-MoE delivers congestion predictions in a more robust and interpretable manner. Furthermore, an ordinal regression strategy is adopted to facilitate effective collaboration among diverse experts. Extensive experiments on real-world datasets demonstrate the superiority of our proposed method compared with state-of-the-art spatio-temporal prediction models. More importantly, CP-MoE has been deployed in DiDi to improve the accuracy and reliability of the travel time estimation system.
Related papers
- Improving Traffic Flow Predictions with SGCN-LSTM: A Hybrid Model for Spatial and Temporal Dependencies [55.2480439325792]
This paper introduces the Signal-Enhanced Graph Convolutional Network Long Short Term Memory (SGCN-LSTM) model for predicting traffic speeds across road networks.
Experiments on the PEMS-BAY road network traffic dataset demonstrate the SGCN-LSTM model's effectiveness.
arXiv Detail & Related papers (2024-11-01T00:37:00Z) - DST-TransitNet: A Dynamic Spatio-Temporal Deep Learning Model for Scalable and Efficient Network-Wide Prediction of Station-Level Transit Ridership [12.6020349733674]
This paper introduces DST-TransitNet, a hybrid Deep Learning model for system-wide ridership prediction.
It is tested on Bogota's BRT system data, with three distinct social scenarios.
It outperforms state-of-the-art models in precision, efficiency and robustness.
arXiv Detail & Related papers (2024-10-19T06:59:39Z) - Physics-guided Active Sample Reweighting for Urban Flow Prediction [75.24539704456791]
Urban flow prediction is a nuanced-temporal modeling that estimates the throughput of transportation services like buses, taxis and ride-driven models.
Some recent prediction solutions bring remedies with the notion of physics-guided machine learning (PGML)
We develop a atized physics-guided network (PN), and propose a data-aware framework Physics-guided Active Sample Reweighting (P-GASR)
arXiv Detail & Related papers (2024-07-18T15:44:23Z) - ST-Mamba: Spatial-Temporal Selective State Space Model for Traffic Flow Prediction [32.44888387725925]
The proposed ST-Mamba model is first to leverage the power of spatial-temporal learning in traffic flow prediction without using graph modeling.
The proposed ST-Mamba model achieves a 61.11% improvement in computational speed and increases prediction accuracy by 0.67%.
Experiments with real-world traffic datasets demonstrate that the textsfST-Mamba model sets a new benchmark in traffic flow prediction.
arXiv Detail & Related papers (2024-04-20T03:57:57Z) - Rethinking Urban Mobility Prediction: A Super-Multivariate Time Series
Forecasting Approach [71.67506068703314]
Long-term urban mobility predictions play a crucial role in the effective management of urban facilities and services.
Traditionally, urban mobility data has been structured as videos, treating longitude and latitude as fundamental pixels.
In our research, we introduce a fresh perspective on urban mobility prediction.
Instead of oversimplifying urban mobility data as traditional video data, we regard it as a complex time series.
arXiv Detail & Related papers (2023-12-04T07:39:05Z) - A Comparative Study of Loss Functions: Traffic Predictions in Regular
and Congestion Scenarios [0.0]
We explore various loss functions inspired by heavy tail analysis and imbalanced classification problems to address this issue.
We discover that when optimizing for Mean Absolute Error (MAE), the MAE-Focal Loss function stands out as the most effective.
This research enhances deep learning models' capabilities in forecasting sudden speed changes due to congestion.
arXiv Detail & Related papers (2023-08-29T17:44:02Z) - Uncertainty Quantification for Image-based Traffic Prediction across
Cities [63.136794104678025]
Uncertainty quantification (UQ) methods provide an approach to induce probabilistic reasoning.
We investigate their application to a large-scale image-based traffic dataset spanning multiple cities.
We find that our approach can capture both temporal and spatial effects on traffic behaviour in a representative case study for the city of Moscow.
arXiv Detail & Related papers (2023-08-11T13:35:52Z) - Interpretable Machine Learning Models for Modal Split Prediction in
Transportation Systems [0.43012765978447565]
Modal split prediction in transportation networks has the potential to support network operators in managing traffic congestion and improving transit service reliability.
We focus on the problem of hourly prediction of the fraction of travelers choosing one mode of transportation over another using high-dimensional travel time data.
We employ various regularization techniques for variable selection to prevent overfitting and resolve multicollinearity issues.
arXiv Detail & Related papers (2022-03-27T02:59:00Z) - SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory
Prediction [64.16212996247943]
We present a Sparse Graph Convolution Network(SGCN) for pedestrian trajectory prediction.
Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians.
visualizations indicate that our method can capture adaptive interactions between pedestrians and their effective motion tendencies.
arXiv Detail & Related papers (2021-04-04T03:17:42Z) - An Effective Dynamic Spatio-temporal Framework with Multi-Source
Information for Traffic Prediction [0.22940141855172028]
The proposed model improves the prediction precision by approximately 3-7 percent on the NYC-Taxi and NYC-Bike datasets.
The experimental results show that the proposed model improves the prediction precision by approximately 3-7 percent on the NYC-Taxi and NYC-Bike datasets.
arXiv Detail & Related papers (2020-05-08T14:23:52Z) - Spatial-Temporal Transformer Networks for Traffic Flow Forecasting [74.76852538940746]
We propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) to improve the accuracy of long-term traffic forecasting.
Specifically, we present a new variant of graph neural networks, named spatial transformer, by dynamically modeling directed spatial dependencies.
The proposed model enables fast and scalable training over a long range spatial-temporal dependencies.
arXiv Detail & Related papers (2020-01-09T10:21:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.