Few-Shot Event Detection with Prototypical Amortized Conditional Random
Field
- URL: http://arxiv.org/abs/2012.02353v1
- Date: Fri, 4 Dec 2020 01:11:13 GMT
- Title: Few-Shot Event Detection with Prototypical Amortized Conditional Random
Field
- Authors: Xin Cong, Shiyao Cui, Bowen Yu, Tingwen Liu, Yubin Wang, Bin Wang
- Abstract summary: Event Detection tends to struggle when it needs to recognize novel event types with a few samples.
We present a novel unified joint model which converts the task to a few-shot tagging problem with a double-part tagging scheme.
We conduct experiments on the benchmark dataset FewEvent and the experimental results show that the tagging based methods are better than existing pipeline and joint learning methods.
- Score: 8.782210889586837
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Event Detection, a fundamental task of Information Extraction, tends to
struggle when it needs to recognize novel event types with a few samples, i.e.
Few-Shot Event Detection (FSED). Previous identify-then-classify paradigm
attempts to solve this problem in the pipeline manner but ignores the trigger
discrepancy between event types, thus suffering from the error propagation. In
this paper, we present a novel unified joint model which converts the task to a
few-shot tagging problem with a double-part tagging scheme. To this end, we
first design the Prototypical Amortized Conditional Random Field (PA-CRF) to
model the label dependency in the few-shot scenario, which builds prototypical
amortization networks to approximate the transition scores between labels based
on the label prototypes. Then Gaussian distribution is introduced for the
modeling of the transition scores in PA-CRF to alleviate the uncertain
estimation resulting from insufficient data. We conduct experiments on the
benchmark dataset FewEvent and the experimental results show that the tagging
based methods are better than existing pipeline and joint learning methods. In
addition, the proposed PA-CRF achieves the best results on the public dataset.
Related papers
- Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection [19.946344683965425]
We propose a novel methodology to address the challenge of FSAD.
We employ a model pre-trained on a large source dataset to model weights.
We evaluate few-shot anomaly detection on on 3 controlled AD tasks and 4 real-world AD tasks to demonstrate the effectiveness of the proposed method.
arXiv Detail & Related papers (2024-02-29T09:48:19Z) - Decoupled Prototype Learning for Reliable Test-Time Adaptation [50.779896759106784]
Test-time adaptation (TTA) is a task that continually adapts a pre-trained source model to the target domain during inference.
One popular approach involves fine-tuning model with cross-entropy loss according to estimated pseudo-labels.
This study reveals that minimizing the classification error of each sample causes the cross-entropy loss's vulnerability to label noise.
We propose a novel Decoupled Prototype Learning (DPL) method that features prototype-centric loss computation.
arXiv Detail & Related papers (2024-01-15T03:33:39Z) - Semi-Supervised Temporal Action Detection with Proposal-Free Masking [134.26292288193298]
We propose a novel Semi-supervised Temporal action detection model based on PropOsal-free Temporal mask (SPOT)
SPOT outperforms state-of-the-art alternatives, often by a large margin.
arXiv Detail & Related papers (2022-07-14T16:58:47Z) - Improve Event Extraction via Self-Training with Gradient Guidance [10.618929821822892]
We propose a Self-Training with Feedback (STF) framework to overcome the main factor that hinders the progress of event extraction.
STF consists of (1) a base event extraction model trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions as pseudo training samples, and (2) a novel scoring model that takes in each new predicted event trigger, an argument, its argument role, as well as their paths in the AMR graph to estimate a compatibility score.
Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+, and ERE
arXiv Detail & Related papers (2022-05-25T04:40:17Z) - Efficient Test-Time Model Adaptation without Forgetting [60.36499845014649]
Test-time adaptation seeks to tackle potential distribution shifts between training and testing data.
We propose an active sample selection criterion to identify reliable and non-redundant samples.
We also introduce a Fisher regularizer to constrain important model parameters from drastic changes.
arXiv Detail & Related papers (2022-04-06T06:39:40Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - Event Data Association via Robust Model Fitting for Event-based Object Tracking [66.05728523166755]
We propose a novel Event Data Association (called EDA) approach to explicitly address the event association and fusion problem.
The proposed EDA seeks for event trajectories that best fit the event data, in order to perform unifying data association and information fusion.
The experimental results show the effectiveness of EDA under challenging scenarios, such as high speed, motion blur, and high dynamic range conditions.
arXiv Detail & Related papers (2021-10-25T13:56:00Z) - Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot
Event Classification [24.598938900747186]
Few-Shot Event Classification (FSEC) aims at developing a model for event prediction, which can generalize to new event types with a limited number of annotated data.
We find existing FSEC models suffer from trigger biases that signify the statistical homogeneity between some trigger words and target event types.
To cope with the context-bypassing problem in FSEC models, we introduce adversarial training and trigger reconstruction techniques.
arXiv Detail & Related papers (2021-08-29T13:46:42Z) - Detecting Anomalous Event Sequences with Temporal Point Processes [28.997992932163008]
We frame the problem of detecting anomalous continuous-time event sequences as out-of-distribution (OoD) detection for temporal point processes (TPPs)
First, we show how this problem can be approached using goodness-of-fit (GoF) tests.
We then demonstrate the limitations of popular GoF statistics for TPPs and propose a new test that addresses these shortcomings.
arXiv Detail & Related papers (2021-06-08T15:50:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.