Causal Analysis and Classification of Traffic Crash Injury Severity
Using Machine Learning Algorithms
- URL: http://arxiv.org/abs/2112.03407v1
- Date: Tue, 30 Nov 2021 20:32:31 GMT
- Title: Causal Analysis and Classification of Traffic Crash Injury Severity
Using Machine Learning Algorithms
- Authors: Meghna Chakraborty, Timothy Gates, Subhrajit Sinha
- Abstract summary: The data used in this study were obtained for traffic crashes on all interstates across the state of Texas from a period of six years between 2014 and 2019.
The output of the proposed severity classification approach includes three classes for fatal and severe injury (KA) crashes, non-severe and possible injury (BC) crashes, and property damage only (PDO) crashes.
The results of Granger causality analysis identified the speed limit, surface and weather conditions, traffic volume, presence of workzones, workers in workzones, and high occupancy vehicle (HOV) lanes, as the most important factors affecting crash severity
- Score: 0.0
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Causal analysis and classification of injury severity applying non-parametric
methods for traffic crashes has received limited attention. This study presents
a methodological framework for causal inference, using Granger causality
analysis, and injury severity classification of traffic crashes, occurring on
interstates, with different machine learning techniques including decision
trees (DT), random forest (RF), extreme gradient boosting (XGBoost), and deep
neural network (DNN). The data used in this study were obtained for traffic
crashes on all interstates across the state of Texas from a period of six years
between 2014 and 2019. The output of the proposed severity classification
approach includes three classes for fatal and severe injury (KA) crashes,
non-severe and possible injury (BC) crashes, and property damage only (PDO)
crashes. While Granger Causality helped identify the most influential factors
affecting crash severity, the learning-based models predicted the severity
classes with varying performance. The results of Granger causality analysis
identified the speed limit, surface and weather conditions, traffic volume,
presence of workzones, workers in workzones, and high occupancy vehicle (HOV)
lanes, among others, as the most important factors affecting crash severity.
The prediction performance of the classifiers yielded varying results across
the different classes. Specifically, while decision tree and random forest
classifiers provided the greatest performance for PDO and BC severities,
respectively, for the KA class, the rarest class in the data, deep neural net
classifier performed superior than all other algorithms, most likely due to its
capability of approximating nonlinear models. This study contributes to the
limited body of knowledge pertaining to causal analysis and classification
prediction of traffic crash injury severity using non-parametric approaches.
Related papers
- Indiscriminate Disruption of Conditional Inference on Multivariate Gaussians [60.22542847840578]
Despite advances in adversarial machine learning, inference for Gaussian models in the presence of an adversary is notably understudied.
We consider a self-interested attacker who wishes to disrupt a decisionmaker's conditional inference and subsequent actions by corrupting a set of evidentiary variables.
To avoid detection, the attacker also desires the attack to appear plausible wherein plausibility is determined by the density of the corrupted evidence.
arXiv Detail & Related papers (2024-11-21T17:46:55Z) - An Explainable Machine Learning Approach to Traffic Accident Fatality Prediction [0.02730969268472861]
Road traffic accidents pose a significant public health threat worldwide.
This study presents a machine learning-based approach for classifying fatal and non-fatal road accident outcomes.
arXiv Detail & Related papers (2024-09-18T12:41:56Z) - Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses [76.59021017301127]
We propose a large-scale traffic crash language dataset, named CrashEvent, summarizing 19,340 real-world crash reports.
We further formulate the crash event feature learning as a novel text reasoning problem and further fine-tune various large language models (LLMs) to predict detailed accident outcomes.
Our experiments results show that our LLM-based approach not only predicts the severity of accidents but also classifies different types of accidents and predicts injury outcomes.
arXiv Detail & Related papers (2024-06-16T03:10:16Z) - Inferring Heterogeneous Treatment Effects of Crashes on Highway Traffic:
A Doubly Robust Causal Machine Learning Approach [15.717402981513812]
This paper proposes a novel causal machine learning framework to estimate the causal effect of different types of crashes on highway speed.
Experimental results from 4815 crashes on Highway Interstate 5 in Washington State reveal the heterogeneous treatment effects of crashes at varying distances and durations.
arXiv Detail & Related papers (2024-01-01T15:03:14Z) - Prediction of Crash Injury Severity in Florida's Interstate-95 [0.0]
Traffic crashes on Florida's Interstate-95 from 2016 to 2021 were gathered.
classification methods were used to estimate the severity of driver injuries.
The Adaboost algorithm outperformed the others in terms of recall and AUC.
arXiv Detail & Related papers (2023-12-16T18:42:39Z) - Exploring Machine Learning Techniques to Identify Important Factors
Leading to Injury in Curve Related Crashes [0.4129225533930965]
This study tries to eliminate shortcomings by considering important pre-crash events related factors as selected variables and the number of vehicles with or without injury as a predicted variable.
This research used CRSS data from the National Highway Traffic Safety Administration (NHTSA), which includes traffic crash-related data for different states in the USA.
Analysis results revealed that the extent of the damage, critical pre-crash event, pre-impact location, the trafficway description, roadway surface condition, the month of the crash, the first harmful event, number of motor vehicles, attempted avoidance maneuver, and roadway grade affect the number of vehicles with or
arXiv Detail & Related papers (2023-01-04T13:07:28Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Anomaly Detection in Cybersecurity: Unsupervised, Graph-Based and
Supervised Learning Methods in Adversarial Environments [63.942632088208505]
Inherent to today's operating environment is the practice of adversarial machine learning.
In this work, we examine the feasibility of unsupervised learning and graph-based methods for anomaly detection.
We incorporate a realistic adversarial training mechanism when training our supervised models to enable strong classification performance in adversarial environments.
arXiv Detail & Related papers (2021-05-14T10:05:10Z) - A model for traffic incident prediction using emergency braking data [77.34726150561087]
We address the fundamental problem of data scarcity in road traffic accident prediction by training our model on emergency braking events instead of accidents.
We present a prototype implementing a traffic incident prediction model for Germany based on emergency braking data from Mercedes-Benz vehicles.
arXiv Detail & Related papers (2021-02-12T18:17:12Z) - Comparison Analysis of Tree Based and Ensembled Regression Algorithms
for Traffic Accident Severity Prediction [2.956978593944786]
Various machine learning models are being used for accident prediction.
Random Forest as the best performing model with highest classification with 0.974 accuracy, 0.954 precision, 0.930 recall and 0.942 F-score.
arXiv Detail & Related papers (2020-10-27T11:52:39Z) - Influence Functions in Deep Learning Are Fragile [52.31375893260445]
influence functions approximate the effect of samples in test-time predictions.
influence estimates are fairly accurate for shallow networks.
Hessian regularization is important to get highquality influence estimates.
arXiv Detail & Related papers (2020-06-25T18:25:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.