Incident duration prediction using a bi-level machine learning framework
with outlier removal and intra-extra joint optimisation
- URL: http://arxiv.org/abs/2205.05197v1
- Date: Tue, 10 May 2022 22:40:05 GMT
- Title: Incident duration prediction using a bi-level machine learning framework
with outlier removal and intra-extra joint optimisation
- Authors: Artur Grigorev, Adriana-Simona Mihaita, Seunghyeon Lee, Fang Chen
- Abstract summary: This paper presents a novel bi-level machine learning framework enhanced with outlier removal and intra-extra joint optimisation.
We use incident data logs to develop a binary classification prediction approach, which allows us to classify traffic incidents as short-term or long-term.
We find the optimal threshold between short-term versus long-term traffic incident duration, targeting both class balance and prediction performance.
- Score: 6.291975267775799
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Predicting the duration of traffic incidents is a challenging task due to the
stochastic nature of events. The ability to accurately predict how long
accidents will last can provide significant benefits to both end-users in their
route choice and traffic operation managers in handling of non-recurrent
traffic congestion. This paper presents a novel bi-level machine learning
framework enhanced with outlier removal and intra-extra joint optimisation for
predicting the incident duration on three heterogeneous data sets collected for
both arterial roads and motorways from Sydney, Australia and San-Francisco,
U.S.A. Firstly, we use incident data logs to develop a binary classification
prediction approach, which allows us to classify traffic incidents as
short-term or long-term. We find the optimal threshold between short-term
versus long-term traffic incident duration, targeting both class balance and
prediction performance while also comparing the binary versus multi-class
classification approaches. Secondly, for more granularity of the incident
duration prediction to the minute level, we propose a new Intra-Extra Joint
Optimisation algorithm (IEO-ML) which extends multiple baseline ML models
tested against several regression scenarios across the data sets. Final results
indicate that: a) 40-45 min is the best split threshold for identifying short
versus long-term incidents and that these incidents should be modelled
separately, b) our proposed IEO-ML approach significantly outperforms baseline
ML models in $66\%$ of all cases showcasing its great potential for accurate
incident duration prediction. Lastly, we evaluate the feature importance and
show that time, location, incident type, incident reporting source and weather
at among the top 10 critical factors which influence how long incidents will
last.
Related papers
- Predicting the duration of traffic incidents for Sydney greater metropolitan area using machine learning methods [3.3373764108905446]
This research presents a comprehensive approach to predicting the duration of traffic incidents and classifying them as short-term or long-term.
We train and evaluate a variety of advanced machine learning models including Gradient Boosted Decision Trees (GBDT), Random Forest, LightGBM, and XGBoost.
XGBoost and LightGBM outperform conventional models with XGBoost achieving the lowest RMSE of 33.7 for predicting incident duration and highest classification F1 score of 0.62 for a 30-minute duration threshold.
arXiv Detail & Related papers (2024-06-27T03:16:09Z) - Loss Shaping Constraints for Long-Term Time Series Forecasting [79.3533114027664]
We present a Constrained Learning approach for long-term time series forecasting that respects a user-defined upper bound on the loss at each time-step.
We propose a practical Primal-Dual algorithm to tackle it, and aims to demonstrate that it exhibits competitive average performance in time series benchmarks, while shaping the errors across the predicted window.
arXiv Detail & Related papers (2024-02-14T18:20:44Z) - A Bi-level Framework for Traffic Accident Duration Prediction:
Leveraging Weather and Road Condition Data within a Practical Optimum
Pipeline [0.5221459608786241]
We gathered accident duration, road conditions, and meteorological data from a database of traffic accidents to check the feasibility of a traffic accident duration pipeline.
Our binary classification random forest model distinguished between short-term and long-term effects with an 83% accuracy rate.
The SHAP value analysis identified weather conditions, wind chill and wind speed as the most influential factors in determining the duration of an accident.
arXiv Detail & Related papers (2023-11-01T16:33:37Z) - Test-Time Compensated Representation Learning for Extreme Traffic
Forecasting [13.859278899032846]
congestion and rush hours can result in low correlation in vehicle speeds at various intersections during adjacent time periods.
Existing methods generally predict future series based on recent and entirely decomposed training data during the testing phase.
We propose a test-time representation learning framework comprising a multi-head spatial spatial transformer model.
arXiv Detail & Related papers (2023-09-16T18:46:34Z) - Towards Out-of-Distribution Sequential Event Prediction: A Causal
Treatment [72.50906475214457]
The goal of sequential event prediction is to estimate the next event based on a sequence of historical events.
In practice, the next-event prediction models are trained with sequential data collected at one time.
We propose a framework with hierarchical branching structures for learning context-specific representations.
arXiv Detail & Related papers (2022-10-24T07:54:13Z) - Accurate non-stationary short-term traffic flow prediction method [0.0]
This paper proposes a Long Short-Term Memory (LSTM) based method that can forecast short-term traffic flow precisely.
The proposed method performs favorably against other state-of-the-art methods with better performance on extreme outliers, delay effects, and trend-changing responses.
arXiv Detail & Related papers (2022-05-01T17:11:34Z) - Interval-censored Hawkes processes [82.87738318505582]
We propose a model to estimate the parameters of a Hawkes process in interval-censored settings.
We show how a non-homogeneous approximation to the Hawkes admits a tractable likelihood in the interval-censored setting.
arXiv Detail & Related papers (2021-04-16T07:29:04Z) - A model for traffic incident prediction using emergency braking data [77.34726150561087]
We address the fundamental problem of data scarcity in road traffic accident prediction by training our model on emergency braking events instead of accidents.
We present a prototype implementing a traffic incident prediction model for Germany based on emergency braking data from Mercedes-Benz vehicles.
arXiv Detail & Related papers (2021-02-12T18:17:12Z) - From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting [54.273455592965355]
Uncertainty in future trajectories stems from two sources: (a) sources known to the agent but unknown to the model, such as long term goals and (b)sources that are unknown to both the agent & the model, such as intent of other agents & irreducible randomness indecisions.
We model the epistemic un-certainty through multimodality in long term goals and the aleatoric uncertainty through multimodality in waypoints& paths.
To exemplify this dichotomy, we also propose a novel long term trajectory forecasting setting, with prediction horizons upto a minute, an order of magnitude longer than prior works.
arXiv Detail & Related papers (2020-12-02T21:01:29Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Long-Short Term Spatiotemporal Tensor Prediction for Passenger Flow
Profile [15.875569404476495]
We focus on a tensor-based prediction and propose several practical techniques to improve prediction.
For long-term prediction specifically, we propose the "Tensor Decomposition + 2-Dimensional Auto-Regressive Moving Average (2D-ARMA)" model.
For short-term prediction, we propose to conduct tensor completion based on tensor clustering to avoid oversimplifying and ensure accuracy.
arXiv Detail & Related papers (2020-04-23T08:30:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.