Weakly-supervised Fine-grained Event Recognition on Social Media Texts
for Disaster Management
- URL: http://arxiv.org/abs/2010.01683v1
- Date: Sun, 4 Oct 2020 21:06:45 GMT
- Title: Weakly-supervised Fine-grained Event Recognition on Social Media Texts
for Disaster Management
- Authors: Wenlin Yao, Cheng Zhang, Shiva Saravanan, Ruihong Huang, Ali Mostafavi
- Abstract summary: People increasingly use social media to report, seek help or share information during disasters.
We present a weakly supervised approach for building high-quality classifiers that label each individual Twitter message with fine-grained event categories.
- Score: 27.811526979673484
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: People increasingly use social media to report emergencies, seek help or
share information during disasters, which makes social networks an important
tool for disaster management. To meet these time-critical needs, we present a
weakly supervised approach for rapidly building high-quality classifiers that
label each individual Twitter message with fine-grained event categories. Most
importantly, we propose a novel method to create high-quality labeled data in a
timely manner that automatically clusters tweets containing an event keyword
and asks a domain expert to disambiguate event word senses and label clusters
quickly. In addition, to process extremely noisy and often rather short
user-generated messages, we enrich tweet representations using preceding
context tweets and reply tweets in building event recognition classifiers. The
evaluation on two hurricanes, Harvey and Florence, shows that using only 1-2
person-hours of human supervision, the rapidly trained weakly supervised
classifiers outperform supervised classifiers trained using more than ten
thousand annotated tweets created in over 50 person-hours.
Related papers
- Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models [0.0]
We assess three commercial large language models in zero-shot classification of short social media posts.
The models are asked to perform two classification tasks: 1) identify if the post is informative in a humanitarian context; and 2) rank and provide probabilities for the post in relation to 16 possible humanitarian classes.
Results are evaluated using macro, weighted, and binary F1-scores.
arXiv Detail & Related papers (2024-09-30T19:33:58Z) - CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics [49.2719253711215]
This study introduces a novel approach to disaster text classification by enhancing a pre-trained Large Language Model (LLM)
Our methodology involves creating a comprehensive instruction dataset from disaster-related tweets, which is then used to fine-tune an open-source LLM.
This fine-tuned model can classify multiple aspects of disaster-related information simultaneously, such as the type of event, informativeness, and involvement of human aid.
arXiv Detail & Related papers (2024-06-16T23:01:10Z) - CrisisMatch: Semi-Supervised Few-Shot Learning for Fine-Grained Disaster
Tweet Classification [51.58605842457186]
We present a fine-grained disaster tweet classification model under the semi-supervised, few-shot learning setting.
Our model, CrisisMatch, effectively classifies tweets into fine-grained classes of interest using few labeled data and large amounts of unlabeled data.
arXiv Detail & Related papers (2023-10-23T07:01:09Z) - Sarcasm Detection in a Disaster Context [103.93691731605163]
We introduce HurricaneSARC, a dataset of 15,000 tweets annotated for intended sarcasm.
Our best model is able to obtain as much as 0.70 F1 on our dataset.
arXiv Detail & Related papers (2023-08-16T05:58:12Z) - IKDSumm: Incorporating Key-phrases into BERT for extractive Disaster
Tweet Summarization [5.299958874647294]
We propose a disaster-specific tweet summarization framework, IKDSumm.
IKDSumm identifies the crucial and important information from each tweet related to a disaster through key-phrases of that tweet.
We utilize these key-phrases to automatically generate a summary of the tweets.
arXiv Detail & Related papers (2023-05-19T11:05:55Z) - RweetMiner: Automatic identification and categorization of help requests
on twitter during disasters [8.288082084424863]
Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people.
Many people turn to social media during disasters for requesting help and/or providing relief to others.
Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets.
arXiv Detail & Related papers (2023-03-04T12:21:45Z) - CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and
Summarization [62.77066949111921]
This paper presents CrisisLTLSum, the largest dataset of local crisis event timelines available to date.
CrisisLTLSum contains 1,000 crisis event timelines across four domains: wildfires, local fires, traffic, and storms.
Our initial experiments indicate a significant gap between the performance of strong baselines compared to the human performance on both tasks.
arXiv Detail & Related papers (2022-10-25T17:32:40Z) - Event-Related Bias Removal for Real-time Disaster Events [67.2965372987723]
Social media has become an important tool to share information about crisis events such as natural disasters and mass attacks.
Detecting actionable posts that contain useful information requires rapid analysis of huge volume of data in real-time.
We train an adversarial neural model to remove latent event-specific biases and improve the performance on tweet importance classification.
arXiv Detail & Related papers (2020-11-02T02:03:07Z) - On Identifying Hashtags in Disaster Twitter Data [55.17975121160699]
We construct a unique dataset of disaster-related tweets annotated with hashtags useful for filtering actionable information.
Using this dataset, we investigate Long Short Term Memory-based models within a Multi-Task Learning framework.
The best performing model achieves an F1-score as high as 92.22%.
arXiv Detail & Related papers (2020-01-05T22:37:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.