Topic Modeling Analysis of Aviation Accident Reports: A Comparative
Study between LDA and NMF Models
- URL: http://arxiv.org/abs/2403.04788v1
- Date: Mon, 4 Mar 2024 01:41:07 GMT
- Title: Topic Modeling Analysis of Aviation Accident Reports: A Comparative
Study between LDA and NMF Models
- Authors: Aziida Nanyonga, Hassan Wasswa and Graham Wild
- Abstract summary: This paper compares two prominent topic modeling techniques, Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF)
LDA demonstrates higher topic coherence, indicating stronger semantic relevance among words within topics.
NMF excelled in producing distinct and granular topics, enabling a more focused analysis of specific aspects of aviation accidents.
- Score: 0.0
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Aviation safety is paramount in the modern world, with a continuous
commitment to reducing accidents and improving safety standards. Central to
this endeavor is the analysis of aviation accident reports, rich textual
resources that hold insights into the causes and contributing factors behind
aviation mishaps. This paper compares two prominent topic modeling techniques,
Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF),
in the context of aviation accident report analysis. The study leverages the
National Transportation Safety Board (NTSB) Dataset with the primary objective
of automating and streamlining the process of identifying latent themes and
patterns within accident reports. The Coherence Value (C_v) metric was used to
evaluate the quality of generated topics. LDA demonstrates higher topic
coherence, indicating stronger semantic relevance among words within topics. At
the same time, NMF excelled in producing distinct and granular topics, enabling
a more focused analysis of specific aspects of aviation accidents.
Related papers
- Computational Safety for Generative AI: A Signal Processing Perspective [65.268245109828]
computational safety is a mathematical framework that enables the quantitative assessment, formulation, and study of safety challenges in GenAI.
We show how sensitivity analysis and loss landscape analysis can be used to detect malicious prompts with jailbreak attempts.
We discuss key open research challenges, opportunities, and the essential role of signal processing in computational AI safety.
arXiv Detail & Related papers (2025-02-18T02:26:50Z) - Exploring Aviation Incident Narratives Using Topic Modeling and Clustering Techniques [0.0]
This study applies advanced natural language processing (NLP) techniques to the National Transportation Safety Board (NTSB) dataset.
Main objectives are identifying latent themes, exploring semantic relationships, assessing probabilistic connections, and cluster incidents based on shared characteristics.
Comparative analysis reveals that LDA performed best with a coherence value of 0.597, pLSA of 0.583, LSA of 0.542, and NMF of 0.437.
arXiv Detail & Related papers (2025-01-14T08:23:15Z) - Natural Language Processing and Deep Learning Models to Classify Phase of Flight in Aviation Safety Occurrences [14.379311972506791]
Researchers applied natural language processing (NLP) and artificial intelligence (AI) models to process text narratives to classify the flight phases of safety occurrences.
The classification performance of two deep learning models, ResNet and sRNN was evaluated, using an initial dataset of 27,000 safety occurrence reports from the NTSB.
arXiv Detail & Related papers (2025-01-11T15:02:49Z) - Analyzing Aviation Safety Narratives with LDA, NMF and PLSA: A Case Study Using Socrata Datasets [0.0]
This study explores the application of topic modelling techniques on the Socrata dataset spanning from 1908 to 2009.
The analysis identified key themes such as pilot error, mechanical failure, weather conditions, and training deficiencies.
Future directions include integrating additional contextual variables, leveraging neural topic models, and enhancing aviation safety protocols.
arXiv Detail & Related papers (2025-01-03T08:14:39Z) - Comparative Analysis of Topic Modeling Techniques on ATSB Text Narratives Using Natural Language Processing [0.0]
This paper explores the application of four prominent topic modelling techniques, namely Probabilistic Latent Semantic Analysis (pLSA), Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA), and Non-negative Matrix Factorization (NMF)
The study examines each technique's ability to unveil latent thematic structures within the data, providing safety professionals with a systematic approach to gain actionable insights.
arXiv Detail & Related papers (2025-01-02T12:21:07Z) - Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses [76.59021017301127]
We propose a large-scale traffic crash language dataset, named CrashEvent, summarizing 19,340 real-world crash reports.
We further formulate the crash event feature learning as a novel text reasoning problem and further fine-tune various large language models (LLMs) to predict detailed accident outcomes.
Our experiments results show that our LLM-based approach not only predicts the severity of accidents but also classifies different types of accidents and predicts injury outcomes.
arXiv Detail & Related papers (2024-06-16T03:10:16Z) - DRUformer: Enhancing the driving scene Important object detection with
driving relationship self-understanding [50.81809690183755]
Traffic accidents frequently lead to fatal injuries, contributing to over 50 million deaths until 2023.
Previous research primarily assessed the importance of individual participants, treating them as independent entities.
We introduce Driving scene Relationship self-Understanding transformer (DRUformer) to enhance the important object detection task.
arXiv Detail & Related papers (2023-11-11T07:26:47Z) - Aviation Safety Risk Analysis and Flight Technology Assessment Issues [0.0]
It focuses on two main areas: analyzing exceedance events and statistically evaluating non-exceedance data.
The proposed solutions involve data preprocessing, reliability assessment, quantifying flight control using neural networks, exploratory data analysis, and establishing real-time automated warnings.
arXiv Detail & Related papers (2023-08-10T14:13:49Z) - A Counterfactual Safety Margin Perspective on the Scoring of Autonomous
Vehicles' Riskiness [52.27309191283943]
This paper presents a data-driven framework for assessing the risk of different AVs' behaviors.
We propose the notion of counterfactual safety margin, which represents the minimum deviation from nominal behavior that could cause a collision.
arXiv Detail & Related papers (2023-08-02T09:48:08Z) - Are Neural Topic Models Broken? [81.15470302729638]
We study the relationship between automated and human evaluation of topic models.
We find that neural topic models fare worse in both respects compared to an established classical method.
arXiv Detail & Related papers (2022-10-28T14:38:50Z) - A model for traffic incident prediction using emergency braking data [77.34726150561087]
We address the fundamental problem of data scarcity in road traffic accident prediction by training our model on emergency braking events instead of accidents.
We present a prototype implementing a traffic incident prediction model for Germany based on emergency braking data from Mercedes-Benz vehicles.
arXiv Detail & Related papers (2021-02-12T18:17:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.