CrisisBERT: a Robust Transformer for Crisis Classification and
Contextual Crisis Embedding
- URL: http://arxiv.org/abs/2005.06627v2
- Date: Mon, 18 May 2020 07:58:23 GMT
- Title: CrisisBERT: a Robust Transformer for Crisis Classification and
Contextual Crisis Embedding
- Authors: Junhua Liu, Trisha Singhal, Lucienne T.M. Blessing, Kristin L. Wood
and Kwan Hui Lim
- Abstract summary: We propose an end-to-end transformer-based model for two crisis classification tasks, namely crisis detection and crisis recognition.
We also proposed Crisis2Vec, an attention-based, document-level contextual embedding architecture for crisis embedding.
- Score: 2.7718973516070684
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Classification of crisis events, such as natural disasters, terrorist attacks
and pandemics, is a crucial task to create early signals and inform relevant
parties for spontaneous actions to reduce overall damage. Despite crisis such
as natural disasters can be predicted by professional institutions, certain
events are first signaled by civilians, such as the recent COVID-19 pandemics.
Social media platforms such as Twitter often exposes firsthand signals on such
crises through high volume information exchange over half a billion tweets
posted daily. Prior works proposed various crisis embeddings and classification
using conventional Machine Learning and Neural Network models. However, none of
the works perform crisis embedding and classification using state of the art
attention-based deep neural networks models, such as Transformers and
document-level contextual embeddings. This work proposes CrisisBERT, an
end-to-end transformer-based model for two crisis classification tasks, namely
crisis detection and crisis recognition, which shows promising results across
accuracy and f1 scores. The proposed model also demonstrates superior
robustness over benchmark, as it shows marginal performance compromise while
extending from 6 to 36 events with only 51.4% additional data points. We also
proposed Crisis2Vec, an attention-based, document-level contextual embedding
architecture for crisis embedding, which achieve better performance than
conventional crisis embedding methods such as Word2Vec and GloVe. To the best
of our knowledge, our works are first to propose using transformer-based crisis
classification and document-level contextual crisis embedding in the
literature.
Related papers
- CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics [49.2719253711215]
This study introduces a novel approach to disaster text classification by enhancing a pre-trained Large Language Model (LLM)
Our methodology involves creating a comprehensive instruction dataset from disaster-related tweets, which is then used to fine-tune an open-source LLM.
This fine-tuned model can classify multiple aspects of disaster-related information simultaneously, such as the type of event, informativeness, and involvement of human aid.
arXiv Detail & Related papers (2024-06-16T23:01:10Z) - CrisisViT: A Robust Vision Transformer for Crisis Image Classification [5.14879510106258]
This paper proposes the use of state-of-the-art deep neural models for automatic image classification/tagging.
We leverage the new Incidents1M crisis image dataset to develop a range of new transformer-based image classification models.
arXiv Detail & Related papers (2024-01-05T14:45:45Z) - CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster
Tweet Classification [51.58605842457186]
We present a fine-grained disaster tweet classification model under the semi-supervised, few-shot learning setting.
Our model, CrisisMatch, effectively classifies tweets into fine-grained classes of interest using few labeled data and large amounts of unlabeled data.
arXiv Detail & Related papers (2023-10-23T07:01:09Z) - DeCrisisMB: Debiased Semi-Supervised Learning for Crisis Tweet
Classification via Memory Bank [52.20298962359658]
In crisis events, people often use social media platforms such as Twitter to disseminate information about the situation, warnings, advice, and support.
fully-supervised approaches require annotating vast amounts of data and are impractical due to limited response time.
Semi-supervised models can be biased, performing moderately well for certain classes while performing extremely poorly for others.
We propose a simple but effective debiasing method, DeCrisisMB, that utilizes a Memory Bank to store and perform equal sampling for generated pseudo-labels from each class at each training.
arXiv Detail & Related papers (2023-10-23T05:25:51Z) - CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts [3.690904966341072]
Social media platforms play an essential role in crisis communication, but analyzing crisis-related social media texts is challenging due to their informal nature.
This study introduces CrisisTransformers, an ensemble of pre-trained language models and sentence encoders trained on an extensive corpus of over 15 billion word tokens from tweets.
arXiv Detail & Related papers (2023-09-11T14:36:16Z) - CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and
Summarization [62.77066949111921]
This paper presents CrisisLTLSum, the largest dataset of local crisis event timelines available to date.
CrisisLTLSum contains 1,000 crisis event timelines across four domains: wildfires, local fires, traffic, and storms.
Our initial experiments indicate a significant gap between the performance of strong baselines compared to the human performance on both tasks.
arXiv Detail & Related papers (2022-10-25T17:32:40Z) - Introducing the ICBe Dataset: Very High Recall and Precision Event
Extraction from Narratives about International Crises [0.0]
We conceive of international affairs as a strategic chess game between adversaries, requiring a systematic way to measure pieces, moves, and gambits.
We develop such a measurement strategy with an ontology of crisis actions and interactions and apply it to a high-quality corpus of crisis narratives recorded by the International Crisis Behavior (ICB) Project.
We introduce a new crisis event dataset ICB Events (ICBe). We find that ICBe captures the process of a crisis with greater accuracy and granularity than other well-regarded events or crisis datasets.
arXiv Detail & Related papers (2022-02-14T23:03:52Z) - Bridging the gap between supervised classification and unsupervised
topic modelling for social-media assisted crisis management [0.5249805590164902]
Social media such as Twitter provide valuable information to crisis managers and affected people during natural disasters.
Machine learning can help structure and extract information from the large volume of messages shared during a crisis.
We show that BERT embeddings finetuned on crisis-related tweet classification can effectively be used to adapt to a new crisis.
arXiv Detail & Related papers (2021-03-22T13:30:39Z) - Event-Related Bias Removal for Real-time Disaster Events [67.2965372987723]
Social media has become an important tool to share information about crisis events such as natural disasters and mass attacks.
Detecting actionable posts that contain useful information requires rapid analysis of huge volume of data in real-time.
We train an adversarial neural model to remove latent event-specific biases and improve the performance on tweet importance classification.
arXiv Detail & Related papers (2020-11-02T02:03:07Z) - Clustering of Social Media Messages for Humanitarian Aid Response during
Crisis [47.187609203210705]
We show that recent advances in Deep Learning and Natural Language Processing outperform prior approaches for the task of classifying informativeness.
We extend these methods to two sub-tasks of informativeness and find that the Deep Learning methods are effective here as well.
arXiv Detail & Related papers (2020-07-23T02:18:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.