The Weather Paradox: Why Precipitation Fails to Predict Traffic Accident Severity in Large-Scale US Data
- URL: http://arxiv.org/abs/2601.00152v1
- Date: Thu, 01 Jan 2026 01:03:01 GMT
- Title: The Weather Paradox: Why Precipitation Fails to Predict Traffic Accident Severity in Large-Scale US Data
- Authors: Yann Bellec, Rohan Kaman, Siwen Cui, Aarav Agrawal, Calvin Chen,
- Abstract summary: This study investigates the predictive capacity of environmental, temporal, and spatial factors on traffic accident severity in the United States.<n>Using a dataset of 500,000 U.S. traffic accidents spanning 2016-2023, we trained an XGBoost classifier optimized through randomized search cross-validation and adjusted for class imbalance via class weighting.<n>The final model achieves an overall accuracy of 78%, with strong performance on the majority class (Severity 2), attaining 87% precision and recall.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This study investigates the predictive capacity of environmental, temporal, and spatial factors on traffic accident severity in the United States. Using a dataset of 500,000 U.S. traffic accidents spanning 2016-2023, we trained an XGBoost classifier optimized through randomized search cross-validation and adjusted for class imbalance via class weighting. The final model achieves an overall accuracy of 78%, with strong performance on the majority class (Severity 2), attaining 87% precision and recall. Feature importance analysis reveals that time of day, geographic location, and weather-related variables, including visibility, temperature, and wind speed, rank among the strongest predictors of accident severity. However, contrary to initial hypotheses, precipitation and visibility demonstrate limited predictive power, potentially reflecting behavioral adaptation by drivers under overtly hazardous conditions. The dataset's predominance of mid-level severity accidents constrains the model's capacity to learn meaningful patterns for extreme cases, highlighting the need for alternative sampling strategies, enhanced feature engineering, and integration of external datasets. These findings contribute to evidence-based traffic management and suggest future directions for severity prediction research.
Related papers
- Forecasting Fails: Unveiling Evasion Attacks in Weather Prediction Models [60.728124907335]
This work introduces Weather Adaptive Adversarial Perturbation Optimization (WAAPO), a novel framework for generating targeted adversarial perturbations.<n>WAAPO achieves this by incorporating constraints for channel sparsity, spatial localization, and smoothness, ensuring that perturbations remain physically realistic and imperceptible.<n>Our experiments highlight critical vulnerabilities in AI-driven forecasting models, where small perturbations to initial conditions can result in significant deviations.
arXiv Detail & Related papers (2025-12-09T17:20:56Z) - Predicting and Explaining Traffic Crash Severity Through Crash Feature Selection [1.0941365324532635]
This study introduces a curated dataset of more than 3 million people involved in accidents in Ohio over six years-2022.<n>The primary contribution is a transparent and reproducible methodology that combines Automated Machine Learning (AutoML) and explainable artificial intelligence (AI) to identify and interpret key risk factors associated with severe crashes.<n>Key features spanned demographic, environmental, vehicle, human and operational categories, including location type posted speed, minimum occupant age, and pre-crash action.
arXiv Detail & Related papers (2025-08-15T14:31:26Z) - Handling Weather Uncertainty in Air Traffic Prediction through an Inverse Approach [0.0]
Adverse weather conditions, particularly convective phenomena, pose significant challenges to Air Traffic Management.<n>This study introduces a 3-D Gaussian Mixture Model to predict long lead-time flight trajectory changes.<n>The model demonstrates robust performance in forecasting reroutes up to 60 minutes.
arXiv Detail & Related papers (2025-04-07T15:42:09Z) - Predicting Subway Passenger Flows under Incident Situation with Causality [5.171258995506716]
We propose a two-stage method that separates predictions under normal conditions and the causal effects of incidents.<n>Our approach is validated using real-world data, demonstrating improved accuracy.<n>Our work can assist subway system managers in estimating passenger flow affected by incidents and enable them to take proactive measures.
arXiv Detail & Related papers (2024-12-09T12:34:13Z) - ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast [57.6987191099507]
We introduce Exloss, a novel loss function that performs asymmetric optimization and highlights extreme values to obtain accurate extreme weather forecast.
We also introduce ExBooster, which captures the uncertainty in prediction outcomes by employing multiple random samples.
Our solution can achieve state-of-the-art performance in extreme weather prediction, while maintaining the overall forecast accuracy comparable to the top medium-range forecast models.
arXiv Detail & Related papers (2024-02-02T10:34:13Z) - A Bi-level Framework for Traffic Accident Duration Prediction:
Leveraging Weather and Road Condition Data within a Practical Optimum
Pipeline [0.5221459608786241]
We gathered accident duration, road conditions, and meteorological data from a database of traffic accidents to check the feasibility of a traffic accident duration pipeline.
Our binary classification random forest model distinguished between short-term and long-term effects with an 83% accuracy rate.
The SHAP value analysis identified weather conditions, wind chill and wind speed as the most influential factors in determining the duration of an accident.
arXiv Detail & Related papers (2023-11-01T16:33:37Z) - Predicting Accident Severity: An Analysis Of Factors Affecting Accident
Severity Using Random Forest Model [0.0]
This study investigates the effectiveness of the Random Forest machine learning algorithm for predicting the severity of an accident.
The model is trained on a dataset of accident records from a large metropolitan area and evaluated using various metrics.
Results show that the Random Forest model is an effective tool for predicting accident severity with an accuracy of over 80%.
arXiv Detail & Related papers (2023-10-09T16:33:44Z) - Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling [58.456404022536425]
State of the art for physical hazard prediction from weather and climate requires expensive km-scale numerical simulations driven by coarser resolution global inputs.
Here, a generative diffusion architecture is explored for downscaling such global inputs to km-scale, as a cost-effective machine learning alternative.
The model is trained to predict 2km data from a regional weather model over Taiwan, conditioned on a 25km global reanalysis.
arXiv Detail & Related papers (2023-09-24T19:57:22Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Uncertainty estimation of pedestrian future trajectory using Bayesian
approximation [137.00426219455116]
Under dynamic traffic scenarios, planning based on deterministic predictions is not trustworthy.
The authors propose to quantify uncertainty during forecasting using approximation which deterministic approaches fail to capture.
The effect of dropout weights and long-term prediction on future state uncertainty has been studied.
arXiv Detail & Related papers (2022-05-04T04:23:38Z) - Heterogeneous-Agent Trajectory Forecasting Incorporating Class
Uncertainty [54.88405167739227]
We present HAICU, a method for heterogeneous-agent trajectory forecasting that explicitly incorporates agents' class probabilities.
We additionally present PUP, a new challenging real-world autonomous driving dataset.
We demonstrate that incorporating class probabilities in trajectory forecasting significantly improves performance in the face of uncertainty.
arXiv Detail & Related papers (2021-04-26T10:28:34Z) - A model for traffic incident prediction using emergency braking data [77.34726150561087]
We address the fundamental problem of data scarcity in road traffic accident prediction by training our model on emergency braking events instead of accidents.
We present a prototype implementing a traffic incident prediction model for Germany based on emergency braking data from Mercedes-Benz vehicles.
arXiv Detail & Related papers (2021-02-12T18:17:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.