CrashFormer: A Multimodal Architecture to Predict the Risk of Crash
- URL: http://arxiv.org/abs/2402.05151v1
- Date: Wed, 7 Feb 2024 13:09:23 GMT
- Title: CrashFormer: A Multimodal Architecture to Predict the Risk of Crash
- Authors: Amin Karimi Monsefi, Pouya Shiri, Ahmad Mohammadshirazi, Nastaran
Karimi Monsefi, Ron Davies, Sobhan Moosavi and Rajiv Ramnath
- Abstract summary: Accident prediction is key to improving traffic safety, enabling proactive measures to be taken before a crash occurs.
We propose CrashFormer, a multi-modal architecture that utilizes comprehensive inputs such as the history of accidents, weather information, map images, and demographic information.
The model predicts the future risk of accidents on a reasonably acceptable cadence (i.e. every six hours) for a geographical location of 5.161 square kilometers.
- Score: 1.3194391758295112
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Reducing traffic accidents is a crucial global public safety concern.
Accident prediction is key to improving traffic safety, enabling proactive
measures to be taken before a crash occurs, and informing safety policies,
regulations, and targeted interventions. Despite numerous studies on accident
prediction over the past decades, many have limitations in terms of
generalizability, reproducibility, or feasibility for practical use due to
input data or problem formulation. To address existing shortcomings, we propose
CrashFormer, a multi-modal architecture that utilizes comprehensive (but
relatively easy to obtain) inputs such as the history of accidents, weather
information, map images, and demographic information. The model predicts the
future risk of accidents on a reasonably acceptable cadence (i.e., every six
hours) for a geographical location of 5.161 square kilometers. CrashFormer is
composed of five components: a sequential encoder to utilize historical
accidents and weather data, an image encoder to use map imagery data, a raw
data encoder to utilize demographic information, a feature fusion module for
aggregating the encoded features, and a classifier that accepts the aggregated
data and makes predictions accordingly. Results from extensive real-world
experiments in 10 major US cities show that CrashFormer outperforms
state-of-the-art sequential and non-sequential models by 1.8% in F1-score on
average when using ``sparse'' input data.
Related papers
- Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses [76.59021017301127]
We propose a large-scale traffic crash language dataset, named CrashEvent, summarizing 19,340 real-world crash reports.
We further formulate the crash event feature learning as a novel text reasoning problem and further fine-tune various large language models (LLMs) to predict detailed accident outcomes.
Our experiments results show that our LLM-based approach not only predicts the severity of accidents but also classifies different types of accidents and predicts injury outcomes.
arXiv Detail & Related papers (2024-06-16T03:10:16Z) - CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster
Tweet Classification [51.58605842457186]
We present a fine-grained disaster tweet classification model under the semi-supervised, few-shot learning setting.
Our model, CrisisMatch, effectively classifies tweets into fine-grained classes of interest using few labeled data and large amounts of unlabeled data.
arXiv Detail & Related papers (2023-10-23T07:01:09Z) - DeepAccident: A Motion and Accident Prediction Benchmark for V2X
Autonomous Driving [76.29141888408265]
We propose a large-scale dataset containing diverse accident scenarios that frequently occur in real-world driving.
The proposed DeepAccident dataset includes 57K annotated frames and 285K annotated samples, approximately 7 times more than the large-scale nuScenes dataset.
arXiv Detail & Related papers (2023-04-03T17:37:00Z) - Cognitive Accident Prediction in Driving Scenes: A Multimodality
Benchmark [77.54411007883962]
We propose a Cognitive Accident Prediction (CAP) method that explicitly leverages human-inspired cognition of text description on the visual observation and the driver attention to facilitate model training.
CAP is formulated by an attentive text-to-vision shift fusion module, an attentive scene context transfer module, and the driver attention guided accident prediction module.
We construct a new large-scale benchmark consisting of 11,727 in-the-wild accident videos with over 2.19 million frames.
arXiv Detail & Related papers (2022-12-19T11:43:02Z) - Predicting Seriousness of Injury in a Traffic Accident: A New Imbalanced
Dataset and Benchmark [62.997667081978825]
The paper introduces a new dataset to assess the performance of machine learning algorithms in the prediction of the seriousness of injury in a traffic accident.
The dataset is created by aggregating publicly available datasets from the UK Department for Transport.
arXiv Detail & Related papers (2022-05-20T21:15:26Z) - Crash Report Data Analysis for Creating Scenario-Wise, Spatio-Temporal
Attention Guidance to Support Computer Vision-based Perception of Fatal Crash
Risks [8.34084323253809]
This paper develops a data analytics model, named scenario-wise, Spatio-temporal attention guidance, from fatal crash report data.
It estimates the relevance of detected objects to fatal crashes from their environment and context information.
The paper shows how the developed attention guidance supports the design and implementation of a preliminary CV model.
arXiv Detail & Related papers (2021-09-06T19:43:37Z) - A model for traffic incident prediction using emergency braking data [77.34726150561087]
We address the fundamental problem of data scarcity in road traffic accident prediction by training our model on emergency braking events instead of accidents.
We present a prototype implementing a traffic incident prediction model for Germany based on emergency braking data from Mercedes-Benz vehicles.
arXiv Detail & Related papers (2021-02-12T18:17:12Z) - Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal
Relational Learning [30.59728753059457]
Traffic accident anticipation aims to predict accidents from dashcam videos as early as possible.
Current deterministic deep neural networks could be overconfident in false predictions.
We propose an uncertainty-based accident anticipation model with relational-temporal learning.
arXiv Detail & Related papers (2020-08-01T20:21:48Z) - Global Feature Aggregation for Accident Anticipation [8.57961305383434]
We propose a novel Feature Aggregation (FA) block that refines each object's features by computing a weighted sum of the features of all objects in a frame.
We use FA block along with Long Short Term Memory (LSTM) network to anticipate accidents in the video sequences.
arXiv Detail & Related papers (2020-06-16T06:17:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.