Natural Language Processing of Aviation Occurrence Reports for Safety
Management
- URL: http://arxiv.org/abs/2301.05663v1
- Date: Fri, 13 Jan 2023 17:00:09 GMT
- Title: Natural Language Processing of Aviation Occurrence Reports for Safety
Management
- Authors: Patrick Jonk, Vincent de Vries, Rombout Wever, Georgios Sidiropoulos,
Evangelos Kanoulas
- Abstract summary: This paper explores various Natural Language Processing (NLP) methods to support the analysis of aviation safety occurrence reports.
Under the right conditions the labeling of occurrence reports can be effectively automated with a transformer-based classifier.
- Score: 7.008490462870145
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Occurrence reporting is a commonly used method in safety management systems
to obtain insight in the prevalence of hazards and accident scenarios. In
support of safety data analysis, reports are often categorized according to a
taxonomy. However, the processing of the reports can require significant effort
from safety analysts and a common problem is interrater variability in labeling
processes. Also, in some cases, reports are not processed according to a
taxonomy, or the taxonomy does not fully cover the contents of the documents.
This paper explores various Natural Language Processing (NLP) methods to
support the analysis of aviation safety occurrence reports. In particular, the
problems studied are the automatic labeling of reports using a classification
model, extracting the latent topics in a collection of texts using a topic
model and the automatic generation of probable cause texts. Experimental
results showed that (i) under the right conditions the labeling of occurrence
reports can be effectively automated with a transformer-based classifier, (ii)
topic modeling can be useful for finding the topics present in a collection of
reports, and (iii) using a summarization model can be a promising direction for
generating probable cause texts.
Related papers
- Intelligent Multi-Document Summarisation for Extracting Insights on Racial Inequalities from Maternity Incident Investigation Reports [0.8609957371651683]
In healthcare, thousands of safety incidents occur every year, but learning from these incidents is not effectively aggregated.
This paper presents I-SIRch:CS, a framework designed to facilitate the aggregation and analysis of safety incident reports.
The framework integrates concept annotation using the Safety Intelligence Research (SIRch) taxonomy with clustering, summarisation, and analysis capabilities.
arXiv Detail & Related papers (2024-07-11T09:11:20Z) - Hierarchical Multi-label Classification for Fine-level Event Extraction from Aviation Accident Reports [18.005377921658308]
This article argues that we can identify the events more accurately by leveraging the event taxonomy.
We achieve this hierarchical classification task by incorporating a novel hierarchical attention module into BERT.
It has been shown that fine-level prediction accuracy is highly improved, and the regularization term can be beneficial to the rare event identification problem.
arXiv Detail & Related papers (2024-03-26T17:51:06Z) - AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation [57.8363998797433]
We propose AMRFact, a framework that generates perturbed summaries using Abstract Meaning Representations (AMRs)
Our approach parses factually consistent summaries into AMR graphs and injects controlled factual inconsistencies to create negative examples, allowing for coherent factually inconsistent summaries to be generated with high error-type coverage.
arXiv Detail & Related papers (2023-11-16T02:56:29Z) - Assessing Privacy Risks in Language Models: A Case Study on
Summarization Tasks [65.21536453075275]
We focus on the summarization task and investigate the membership inference (MI) attack.
We exploit text similarity and the model's resistance to document modifications as potential MI signals.
We discuss several safeguards for training summarization models to protect against MI attacks and discuss the inherent trade-off between privacy and utility.
arXiv Detail & Related papers (2023-10-20T05:44:39Z) - AGIR: Automating Cyber Threat Intelligence Reporting with Natural
Language Generation [15.43868945929965]
We introduce AGIR (Automatic Generation of Intelligence Reports), a transformative tool for CTI reporting.
AGIR's primary objective is to empower security analysts by automating the labor-intensive task of generating comprehensive intelligence reports.
We evaluate AGIR's report generation capabilities both quantitatively and qualitatively.
arXiv Detail & Related papers (2023-10-04T08:25:37Z) - MQAG: Multiple-choice Question Answering and Generation for Assessing
Information Consistency in Summarization [55.60306377044225]
State-of-the-art summarization systems can generate highly fluent summaries.
These summaries, however, may contain factual inconsistencies and/or information not present in the source.
We introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared.
arXiv Detail & Related papers (2023-01-28T23:08:25Z) - Relational Action Bases: Formalization, Effective Safety Verification,
and Invariants (Extended Version) [67.99023219822564]
We introduce the general framework of relational action bases (RABs)
RABs generalize existing models by lifting both restrictions.
We demonstrate the effectiveness of this approach on a benchmark of data-aware business processes.
arXiv Detail & Related papers (2022-08-12T17:03:50Z) - Relation Clustering in Narrative Knowledge Graphs [71.98234178455398]
relational sentences in the original text are embedded (with SBERT) and clustered in order to merge together semantically similar relations.
Preliminary tests show that such clustering might successfully detect similar relations, and provide a valuable preprocessing for semi-supervised approaches.
arXiv Detail & Related papers (2020-11-27T10:43:04Z) - Predicting Themes within Complex Unstructured Texts: A Case Study on
Safeguarding Reports [66.39150945184683]
We focus on the problem of automatically identifying the main themes in a safeguarding report using supervised classification approaches.
Our results show the potential of deep learning models to simulate subject-expert behaviour even for complex tasks with limited labelled data.
arXiv Detail & Related papers (2020-10-27T19:48:23Z) - Tense, aspect and mood based event extraction for situation analysis and
crisis management [0.0]
This thesis develops such a system for Turkish language.
It can, in addition to extracting basic event structures, classify sentences given in news reports according to their temporal, modal and volitional/illocutionary values.
arXiv Detail & Related papers (2020-08-01T19:22:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.