BoschAI @ Causal News Corpus 2023: Robust Cause-Effect Span Extraction
using Multi-Layer Sequence Tagging and Data Augmentation
- URL: http://arxiv.org/abs/2312.06338v1
- Date: Mon, 11 Dec 2023 12:35:35 GMT
- Title: BoschAI @ Causal News Corpus 2023: Robust Cause-Effect Span Extraction
using Multi-Layer Sequence Tagging and Data Augmentation
- Authors: Timo Pierre Schrader, Simon Razniewski, Lukas Lange, Annemarie
Friedrich
- Abstract summary: Event Causality Identification with Causal News Corpus Shared Task addresses two aspects of this challenge.
Subtask 1 aims at detecting causal relationships in texts, and Subtask 2 requires identifying signal words and the spans that refer to the cause or effect.
Our system, which is based on pre-trained transformers, stacked sequence tagging, and synthetic data augmentation, ranks third in Subtask 1 and wins Subtask 2 with an F1 score of 72.8.
- Score: 16.59785586761074
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Understanding causality is a core aspect of intelligence. The Event Causality
Identification with Causal News Corpus Shared Task addresses two aspects of
this challenge: Subtask 1 aims at detecting causal relationships in texts, and
Subtask 2 requires identifying signal words and the spans that refer to the
cause or effect, respectively. Our system, which is based on pre-trained
transformers, stacked sequence tagging, and synthetic data augmentation, ranks
third in Subtask 1 and wins Subtask 2 with an F1 score of 72.8, corresponding
to a margin of 13 pp. to the second-best system.
Related papers
- Iteratively Improving Biomedical Entity Linking and Event Extraction via
Hard Expectation-Maximization [9.422435686239538]
Biomedical entity linking and event extraction are two crucial tasks to support text understanding and retrieval in the biomedical domain.
Previous research typically solves these two tasks separately or in a pipeline, leading to error propagation.
We propose joint biomedical entity linking and event extraction by regarding the event structures and entity references in knowledge bases as latent variables.
arXiv Detail & Related papers (2023-05-24T02:30:31Z) - Event Causality Identification with Causal News Corpus -- Shared Task 3,
CASE 2022 [3.0775142635531685]
Event Causality Identification Shared Task of CASE 2022 involved two subtasks.
Subtask 1 required participants to predict if a sentence contains a causal relation or not.
Subtask 2 required participants to identify the Cause, Effect and Signal spans per causal sentence.
arXiv Detail & Related papers (2022-11-22T10:34:09Z) - UniCausal: Unified Benchmark and Repository for Causal Text Mining [7.402967063220846]
We propose UniCausal, a unified benchmark for causal text mining across three tasks.
We consolidate and align annotations of six high quality, mainly human-annotated, corpora.
To create an initial benchmark, we fine-tuned BERT pre-trained language models to each task, achieving 70.10% Binary F1, 52.42% Macro F1, and 84.68% Binary F1 scores respectively.
arXiv Detail & Related papers (2022-08-19T06:14:05Z) - Unifying Event Detection and Captioning as Sequence Generation via
Pre-Training [53.613265415703815]
We propose a unified pre-training and fine-tuning framework to enhance the inter-task association between event detection and captioning.
Our model outperforms the state-of-the-art methods, and can be further boosted when pre-trained on extra large-scale video-text data.
arXiv Detail & Related papers (2022-07-18T14:18:13Z) - Fast Inference and Transfer of Compositional Task Structures for
Few-shot Task Generalization [101.72755769194677]
We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph.
Our multi-task subtask graph inferencer (MTSGI) first infers the common high-level task structure in terms of the subtask graph from the training tasks.
Our experiment results on 2D grid-world and complex web navigation domains show that the proposed method can learn and leverage the common underlying structure of the tasks for faster adaptation to the unseen tasks.
arXiv Detail & Related papers (2022-05-25T10:44:25Z) - Learning Constraints and Descriptive Segmentation for Subevent Detection [74.48201657623218]
We propose an approach to learning and enforcing constraints that capture dependencies between subevent detection and EventSeg prediction.
We adopt Rectifier Networks for constraint learning and then convert the learned constraints to a regularization term in the loss function of the neural model.
arXiv Detail & Related papers (2021-09-13T20:50:37Z) - Cross-Domain Contract Element Extraction with a Bi-directional Feedback
Clause-Element Relation Network [70.00960496773938]
Bi-directional Feedback cLause-Element relaTion network (Bi-FLEET) is proposed for the cross-domain contract element extraction task.
Bi-FLEET has three main components: (1) a context encoder, (2) a clause-element relation encoder, and (3) an inference layer.
The experimental results over both cross-domain NER and CEE tasks show that Bi-FLEET significantly outperforms state-of-the-art baselines.
arXiv Detail & Related papers (2021-05-13T05:14:36Z) - On Improving Deep Learning Trace Analysis with System Call Arguments [1.3299507495084417]
Kernel traces are sequences of low-level events comprising a name and multiple arguments.
We introduce a general approach to learning a representation of the event names along with their arguments using both embedding and encoding.
arXiv Detail & Related papers (2021-03-11T19:26:34Z) - A Co-Interactive Transformer for Joint Slot Filling and Intent Detection [61.109486326954205]
Intent detection and slot filling are two main tasks for building a spoken language understanding (SLU) system.
Previous studies either model the two tasks separately or only consider the single information flow from intent to slot.
We propose a Co-Interactive Transformer to consider the cross-impact between the two tasks simultaneously.
arXiv Detail & Related papers (2020-10-08T10:16:52Z) - An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment
Analysis [73.7488524683061]
We propose a novel Iterative Multi-Knowledge Transfer Network (IMKTN) for end-to-end ABSA.
Our IMKTN transfers the task-specific knowledge from any two of the three subtasks to another one at the token level by utilizing a well-designed routing algorithm.
Experimental results on three benchmark datasets demonstrate the effectiveness and superiority of our approach.
arXiv Detail & Related papers (2020-04-04T13:49:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.