Related papers: CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding

URL: http://arxiv.org/abs/2005.06627v2
Date: Mon, 18 May 2020 07:58:23 GMT
Title: CrisisBERT: a Robust Transformer for Crisis Classification and Contextual Crisis Embedding
Authors: Junhua Liu, Trisha Singhal, Lucienne T.M. Blessing, Kristin L. Wood and Kwan Hui Lim
Abstract summary: We propose an end-to-end transformer-based model for two crisis classification tasks, namely crisis detection and crisis recognition. We also proposed Crisis2Vec, an attention-based, document-level contextual embedding architecture for crisis embedding.
Score: 2.7718973516070684
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Classification of crisis events, such as natural disasters, terrorist attacks and pandemics, is a crucial task to create early signals and inform relevant parties for spontaneous actions to reduce overall damage. Despite crisis such as natural disasters can be predicted by professional institutions, certain events are first signaled by civilians, such as the recent COVID-19 pandemics. Social media platforms such as Twitter often exposes firsthand signals on such crises through high volume information exchange over half a billion tweets posted daily. Prior works proposed various crisis embeddings and classification using conventional Machine Learning and Neural Network models. However, none of the works perform crisis embedding and classification using state of the art attention-based deep neural networks models, such as Transformers and document-level contextual embeddings. This work proposes CrisisBERT, an end-to-end transformer-based model for two crisis classification tasks, namely crisis detection and crisis recognition, which shows promising results across accuracy and f1 scores. The proposed model also demonstrates superior robustness over benchmark, as it shows marginal performance compromise while extending from 6 to 36 events with only 51.4% additional data points. We also proposed Crisis2Vec, an attention-based, document-level contextual embedding architecture for crisis embedding, which achieve better performance than conventional crisis embedding methods such as Word2Vec and GloVe. To the best of our knowledge, our works are first to propose using transformer-based crisis classification and document-level contextual crisis embedding in the literature.

Related papers

CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics [49.2719253711215]
This study introduces a novel approach to disaster text classification by enhancing a pre-trained Large Language Model (LLM) Our methodology involves creating a comprehensive instruction dataset from disaster-related tweets, which is then used to fine-tune an open-source LLM. This fine-tuned model can classify multiple aspects of disaster-related information simultaneously, such as the type of event, informativeness, and involvement of human aid.
arXiv Detail & Related papers (2024-06-16T23:01:10Z)
CrisisViT: A Robust Vision Transformer for Crisis Image Classification [5.14879510106258]
This paper proposes the use of state-of-the-art deep neural models for automatic image classification/tagging. We leverage the new Incidents1M crisis image dataset to develop a range of new transformer-based image classification models.
arXiv Detail & Related papers (2024-01-05T14:45:45Z)
CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster Tweet Classification [51.58605842457186]
We present a fine-grained disaster tweet classification model under the semi-supervised, few-shot learning setting. Our model, CrisisMatch, effectively classifies tweets into fine-grained classes of interest using few labeled data and large amounts of unlabeled data.
arXiv Detail & Related papers (2023-10-23T07:01:09Z)
DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet Classification via Memory Bank [52.20298962359658]
In crisis events, people often use social media platforms such as Twitter to disseminate information about the situation, warnings, advice, and support. fully-supervised approaches require annotating vast amounts of data and are impractical due to limited response time. Semi-supervised models can be biased, performing moderately well for certain classes while performing extremely poorly for others. We propose a simple but effective debiasing method, DeCrisisMB, that utilizes a Memory Bank to store and perform equal sampling for generated pseudo-labels from each class at each training.
arXiv Detail & Related papers (2023-10-23T05:25:51Z)
CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts [3.690904966341072]
Social media platforms play an essential role in crisis communication, but analyzing crisis-related social media texts is challenging due to their informal nature. This study introduces CrisisTransformers, an ensemble of pre-trained language models and sentence encoders trained on an extensive corpus of over 15 billion word tokens from tweets.
arXiv Detail & Related papers (2023-09-11T14:36:16Z)
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and Summarization [62.77066949111921]
This paper presents CrisisLTLSum, the largest dataset of local crisis event timelines available to date. CrisisLTLSum contains 1,000 crisis event timelines across four domains: wildfires, local fires, traffic, and storms. Our initial experiments indicate a significant gap between the performance of strong baselines compared to the human performance on both tasks.
arXiv Detail & Related papers (2022-10-25T17:32:40Z)
Introducing the ICBe Dataset: Very High Recall and Precision Event Extraction from Narratives about International Crises [0.0]
We conceive of international affairs as a strategic chess game between adversaries, requiring a systematic way to measure pieces, moves, and gambits. We develop such a measurement strategy with an ontology of crisis actions and interactions and apply it to a high-quality corpus of crisis narratives recorded by the International Crisis Behavior (ICB) Project. We introduce a new crisis event dataset ICB Events (ICBe). We find that ICBe captures the process of a crisis with greater accuracy and granularity than other well-regarded events or crisis datasets.
arXiv Detail & Related papers (2022-02-14T23:03:52Z)
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management [0.5249805590164902]
Social media such as Twitter provide valuable information to crisis managers and affected people during natural disasters. Machine learning can help structure and extract information from the large volume of messages shared during a crisis. We show that BERT embeddings finetuned on crisis-related tweet classification can effectively be used to adapt to a new crisis.
arXiv Detail & Related papers (2021-03-22T13:30:39Z)
Event-Related Bias Removal for Real-time Disaster Events [67.2965372987723]
Social media has become an important tool to share information about crisis events such as natural disasters and mass attacks. Detecting actionable posts that contain useful information requires rapid analysis of huge volume of data in real-time. We train an adversarial neural model to remove latent event-specific biases and improve the performance on tweet importance classification.
arXiv Detail & Related papers (2020-11-02T02:03:07Z)
Clustering of Social Media Messages for Humanitarian Aid Response during Crisis [47.187609203210705]
We show that recent advances in Deep Learning and Natural Language Processing outperform prior approaches for the task of classifying informativeness. We extend these methods to two sub-tasks of informativeness and find that the Deep Learning methods are effective here as well.
arXiv Detail & Related papers (2020-07-23T02:18:05Z)

This list is automatically generated from the titles and abstracts of the papers in this site.