Multi-task transfer learning for finding actionable information from
crisis-related messages on social media
- URL: http://arxiv.org/abs/2102.13395v1
- Date: Fri, 26 Feb 2021 11:11:33 GMT
- Title: Multi-task transfer learning for finding actionable information from
crisis-related messages on social media
- Authors: Congcong Wang, David Lillis
- Abstract summary: The Incident streams (IS) track is a research challenge aimed at finding important information from social media during crises for emergency response purposes.
Given a stream of crisis-related tweets, the IS challenge asks a participating system to classify what the types of users' concerns or needs are expressed in each tweet.
We describe our multi-task transfer learning approach for this challenge.
- Score: 3.4392739159262145
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The Incident streams (IS) track is a research challenge aimed at finding
important information from social media during crises for emergency response
purposes. More specifically, given a stream of crisis-related tweets, the IS
challenge asks a participating system to 1) classify what the types of users'
concerns or needs are expressed in each tweet, known as the information type
(IT) classification task and 2) estimate how critical each tweet is with regard
to emergency response, known as the priority level prediction task. In this
paper, we describe our multi-task transfer learning approach for this
challenge. Our approach leverages state-of-the-art transformer models including
both encoder-based models such as BERT and a sequence-to-sequence based T5 for
joint transfer learning on the two tasks. Based on this approach, we submitted
several runs to the track. The returned evaluation results show that our runs
substantially outperform other participating runs in both IT classification and
priority level prediction.
Related papers
- Double Mixture: Towards Continual Event Detection from Speech [60.33088725100812]
Speech event detection is crucial for multimedia retrieval, involving the tagging of both semantic and acoustic events.
This paper tackles two primary challenges in speech event detection: the continual integration of new events without forgetting previous ones, and the disentanglement of semantic from acoustic events.
We propose a novel method, 'Double Mixture,' which merges speech expertise with robust memory mechanisms to enhance adaptability and prevent forgetting.
arXiv Detail & Related papers (2024-04-20T06:32:00Z) - Multi-Query Focused Disaster Summarization via Instruction-Based
Prompting [3.6199702611839792]
CrisisFACTS aims to advance disaster summarization based on multi-stream fact-finding.
Here, participants are asked to develop systems that can extract key facts from several disaster-related events.
This paper describes our method to tackle this challenging task.
arXiv Detail & Related papers (2024-02-14T08:22:58Z) - Data-CUBE: Data Curriculum for Instruction-based Sentence Representation
Learning [85.66907881270785]
We propose a data curriculum method, namely Data-CUBE, that arranges the orders of all the multi-task data for training.
In the task level, we aim to find the optimal task order to minimize the total cross-task interference risk.
In the instance level, we measure the difficulty of all instances per task, then divide them into the easy-to-difficult mini-batches for training.
arXiv Detail & Related papers (2024-01-07T18:12:20Z) - Zero-shot Item-based Recommendation via Multi-task Product Knowledge
Graph Pre-Training [106.85813323510783]
This paper presents a novel paradigm for the Zero-Shot Item-based Recommendation (ZSIR) task.
It pre-trains a model on product knowledge graph (PKG) to refine the item features from PLMs.
We identify three challenges for pre-training PKG, which are multi-type relations in PKG, semantic divergence between item generic information and relations and domain discrepancy from PKG to downstream ZSIR task.
arXiv Detail & Related papers (2023-05-12T17:38:24Z) - Enhancing Crisis-Related Tweet Classification with Entity-Masked
Language Modeling and Multi-Task Learning [0.30458514384586394]
We propose a combination of entity-masked language modeling and hierarchical multi-label classification as a multi-task learning problem.
We evaluate our method on tweets from the TREC-IS dataset and show an absolute performance gain w.r.t. F1-score of up to 10% for actionable information types.
arXiv Detail & Related papers (2022-11-21T13:54:10Z) - Multimedia Generative Script Learning for Task Planning [58.73725388387305]
We propose a new task, Multimedia Generative Script Learning, to generate subsequent steps by tracking historical states in both text and vision modalities.
This task is challenging in three aspects: the multimedia challenge of capturing the visual states in images, the induction challenge of performing unseen tasks, and the diversity challenge of covering different information in individual steps.
Experiment results demonstrate that our approach significantly outperforms strong baselines.
arXiv Detail & Related papers (2022-08-25T19:04:28Z) - Fast Inference and Transfer of Compositional Task Structures for
Few-shot Task Generalization [101.72755769194677]
We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph.
Our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks.
Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks.
arXiv Detail & Related papers (2022-05-25T10:44:25Z) - UCD-CS at TREC 2021 Incident Streams Track [3.4392739159262145]
The TREC Incident Streams (IS) track is a research challenge organised for this purpose.
The track asks participating systems to both classify a stream of crisis-related tweets into humanitarian aid related information types.
We report on the participation of the University College Dublin School of Computer Science (UCD-CS) in TREC-IS 2021.
arXiv Detail & Related papers (2021-12-07T14:47:27Z) - Variational Multi-Task Learning with Gumbel-Softmax Priors [105.22406384964144]
Multi-task learning aims to explore task relatedness to improve individual tasks.
We propose variational multi-task learning (VMTL), a general probabilistic inference framework for learning multiple related tasks.
arXiv Detail & Related papers (2021-11-09T18:49:45Z) - Transformer-based Multi-task Learning for Disaster Tweet Categorisation [2.9112649816695204]
Social media has enabled people to circulate information in a timely fashion, thus motivating people to post messages seeking help during crisis situations.
These messages can contribute to the situational awareness of emergency responders, who have a need for them to be categorised according to information types.
We introduce a transformer-based multi-task learning (MTL) technique for classifying information types and estimating the priority of these messages.
arXiv Detail & Related papers (2021-10-15T11:13:46Z) - Unsupervised and Interpretable Domain Adaptation to Rapidly Filter
Tweets for Emergency Services [18.57009530004948]
We present a novel method to classify relevant tweets during an ongoing crisis using the publicly available dataset of TREC incident streams.
We use dedicated attention layers for each task to provide model interpretability; critical for real-word applications.
We show a practical implication of our work by providing a use-case for the COVID-19 pandemic.
arXiv Detail & Related papers (2020-03-04T06:40:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.