Adaptive Prompt Learning with Distilled Connective Knowledge for
Implicit Discourse Relation Recognition
- URL: http://arxiv.org/abs/2309.07561v1
- Date: Thu, 14 Sep 2023 09:44:46 GMT
- Title: Adaptive Prompt Learning with Distilled Connective Knowledge for
Implicit Discourse Relation Recognition
- Authors: Bang Wang, Zhenglin Wang, Wei Xiang and Yijun Mo
- Abstract summary: Implicit discourse relation recognition (IDRR) aims at recognizing the discourse relation between two text segments without an explicit connective.
We propose a continuous version of prompt learning together with connective knowledge distillation, called AdaptPrompt, to reduce manual design efforts via continuous prompting.
We also design an answer-relation mapping rule to generate a few virtual answers as the answer space.
- Score: 18.42715011594281
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Implicit discourse relation recognition (IDRR) aims at recognizing the
discourse relation between two text segments without an explicit connective.
Recently, the prompt learning has just been applied to the IDRR task with great
performance improvements over various neural network-based approaches. However,
the discrete nature of the state-art-of-art prompting approach requires manual
design of templates and answers, a big hurdle for its practical applications.
In this paper, we propose a continuous version of prompt learning together with
connective knowledge distillation, called AdaptPrompt, to reduce manual design
efforts via continuous prompting while further improving performance via
knowledge transfer. In particular, we design and train a few virtual tokens to
form continuous templates and automatically select the most suitable one by
gradient search in the embedding space. We also design an answer-relation
mapping rule to generate a few virtual answers as the answer space.
Furthermore, we notice the importance of annotated connectives in the training
dataset and design a teacher-student architecture for knowledge transfer.
Experiments on the up-to-date PDTB Corpus V3.0 validate our design objectives
in terms of the better relation recognition performance over the
state-of-the-art competitors.
Related papers
- Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition [96.62264528407863]
We propose a self-supervised contrastive learning framework to excavate rich context via spatial-temporal consistency.
Inspired by the complementary property of motion and joint modalities, we first introduce first-order motion information into sign language modeling.
Our method is evaluated with extensive experiments on four public benchmarks, and achieves new state-of-the-art performance with a notable margin.
arXiv Detail & Related papers (2024-06-15T04:50:19Z) - Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching [53.05954114863596]
We propose a brand-new Deep Boosting Learning (DBL) algorithm for image-text matching.
An anchor branch is first trained to provide insights into the data properties.
A target branch is concurrently tasked with more adaptive margin constraints to further enlarge the relative distance between matched and unmatched samples.
arXiv Detail & Related papers (2024-04-28T08:44:28Z) - Prompt-based Logical Semantics Enhancement for Implicit Discourse
Relation Recognition [4.7938839332508945]
We propose a Prompt-based Logical Semantics Enhancement (PLSE) method for Implicit Discourse Relation Recognition (IDRR)
Our method seamlessly injects knowledge relevant to discourse relation into pre-trained language models through prompt-based connective prediction.
Experimental results on PDTB 2.0 and CoNLL16 datasets demonstrate that our method achieves outstanding and consistent performance against the current state-of-the-art models.
arXiv Detail & Related papers (2023-11-01T08:38:08Z) - Transferring Procedural Knowledge across Commonsense Tasks [17.929737518694616]
We study the ability of AI models to transfer procedural knowledge to novel narrative tasks in a transparent manner.
We design LEAP: a comprehensive framework that integrates state-of-the-art modeling architectures, training regimes, and augmentation strategies.
Our experiments with in- and out-of-domain tasks reveal insights into the interplay of different architectures, training regimes, and augmentation strategies.
arXiv Detail & Related papers (2023-04-26T23:24:50Z) - Continual Prompt Tuning for Dialog State Tracking [58.66412648276873]
A desirable dialog system should be able to continually learn new skills without forgetting old ones.
We present Continual Prompt Tuning, a parameter-efficient framework that not only avoids forgetting but also enables knowledge transfer between tasks.
arXiv Detail & Related papers (2022-03-13T13:22:41Z) - KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization
for Relation Extraction [111.74812895391672]
We propose a Knowledge-aware Prompt-tuning approach with synergistic optimization (KnowPrompt)
We inject latent knowledge contained in relation labels into prompt construction with learnable virtual type words and answer words.
arXiv Detail & Related papers (2021-04-15T17:57:43Z) - Exploiting Structured Knowledge in Text via Graph-Guided Representation
Learning [73.0598186896953]
We present two self-supervised tasks learning over raw text with the guidance from knowledge graphs.
Building upon entity-level masked language models, our first contribution is an entity masking scheme.
In contrast to existing paradigms, our approach uses knowledge graphs implicitly, only during pre-training.
arXiv Detail & Related papers (2020-04-29T14:22:42Z) - A Dependency Syntactic Knowledge Augmented Interactive Architecture for
End-to-End Aspect-based Sentiment Analysis [73.74885246830611]
We propose a novel dependency syntactic knowledge augmented interactive architecture with multi-task learning for end-to-end ABSA.
This model is capable of fully exploiting the syntactic knowledge (dependency relations and types) by leveraging a well-designed Dependency Relation Embedded Graph Convolutional Network (DreGcn)
Extensive experimental results on three benchmark datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2020-04-04T14:59:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.