A Survey of Implicit Discourse Relation Recognition
- URL: http://arxiv.org/abs/2203.02982v1
- Date: Sun, 6 Mar 2022 15:12:53 GMT
- Title: A Survey of Implicit Discourse Relation Recognition
- Authors: Wei Xiang and Bang Wang
- Abstract summary: implicit discourse relation recognition (IDRR) is to detect implicit relation and classify its sense between two text segments without a connective.
This article provides a comprehensive and up-to-date survey for the IDRR task.
- Score: 9.57170901247685
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A discourse containing one or more sentences describes daily issues and
events for people to communicate their thoughts and opinions. As sentences are
normally consist of multiple text segments, correct understanding of the theme
of a discourse should take into consideration of the relations in between text
segments. Although sometimes a connective exists in raw texts for conveying
relations, it is more often the cases that no connective exists in between two
text segments but some implicit relation does exist in between them. The task
of implicit discourse relation recognition (IDRR) is to detect implicit
relation and classify its sense between two text segments without a connective.
Indeed, the IDRR task is important to diverse downstream natural language
processing tasks, such as text summarization, machine translation and so on.
This article provides a comprehensive and up-to-date survey for the IDRR task.
We first summarize the task definition and data sources widely used in the
field. We categorize the main solution approaches for the IDRR task from the
viewpoint of its development history. In each solution category, we present and
analyze the most representative methods, including their origins, ideas,
strengths and weaknesses. We also present performance comparisons for those
solutions experimented on a public corpus with standard data processing
procedures. Finally, we discuss future research directions for discourse
relation analysis.
Related papers
- CERD: A Comprehensive Chinese Rhetoric Dataset for Rhetorical Understanding and Generation in Essays [30.728539221991188]
Existing rhetorical datasets or corpora primarily focus on single coarse-grained categories or fine-grained categories.
We propose the Chinese Essay Rhetoric dataset (CERD), consisting of 4 commonly used coarse-grained categories.
CERD is a manually annotated and comprehensive Chinese rhetoric dataset with five interrelated sub-tasks.
arXiv Detail & Related papers (2024-09-29T12:47:25Z) - Narrative Action Evaluation with Prompt-Guided Multimodal Interaction [60.281405999483]
Narrative action evaluation (NAE) aims to generate professional commentary that evaluates the execution of an action.
NAE is a more challenging task because it requires both narrative flexibility and evaluation rigor.
We propose a prompt-guided multimodal interaction framework to facilitate the interaction between different modalities of information.
arXiv Detail & Related papers (2024-04-22T17:55:07Z) - Explaining Interactions Between Text Spans [50.70253702800355]
Reasoning over spans of tokens from different parts of the input is essential for natural language understanding.
We introduce SpanEx, a dataset of human span interaction explanations for two NLU tasks: NLI and FC.
We then investigate the decision-making processes of multiple fine-tuned large language models in terms of the employed connections between spans.
arXiv Detail & Related papers (2023-10-20T13:52:37Z) - Topic Segmentation Model Focusing on Local Context [1.9871897882042773]
We propose siamese sentence embedding layers which process two input sentences independently to get appropriate amount of information.
Also, we adopt multi-task learning techniques including Same Topic Prediction (STP), Topic Classification (TC) and Next Sentence Prediction (NSP)
arXiv Detail & Related papers (2023-01-05T06:57:42Z) - PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and
Entailment Recognition [63.51569687229681]
We argue for the need to recognize the textual entailment relation of each proposition in a sentence individually.
We propose PropSegmEnt, a corpus of over 45K propositions annotated by expert human raters.
Our dataset structure resembles the tasks of (1) segmenting sentences within a document to the set of propositions, and (2) classifying the entailment relation of each proposition with respect to a different yet topically-aligned document.
arXiv Detail & Related papers (2022-12-21T04:03:33Z) - Relational Sentence Embedding for Flexible Semantic Matching [86.21393054423355]
We present Sentence Embedding (RSE), a new paradigm to discover further the potential of sentence embeddings.
RSE is effective and flexible in modeling sentence relations and outperforms a series of state-of-the-art embedding methods.
arXiv Detail & Related papers (2022-12-17T05:25:17Z) - Dense Paraphrasing for Textual Enrichment [7.6233489924270765]
We define the process of rewriting a textual expression (lexeme or phrase) such that it reduces ambiguity while also making explicit the underlying semantics that is not (necessarily) expressed in the economy of sentence structure as Dense Paraphrasing (DP)
We build the first complete DP dataset, provide the scope and design of the annotation task, and present results demonstrating how this DP process can enrich a source text to improve inferencing and QA task performance.
arXiv Detail & Related papers (2022-10-20T19:58:31Z) - Improving Multi-task Generalization Ability for Neural Text Matching via
Prompt Learning [54.66399120084227]
Recent state-of-the-art neural text matching models (PLMs) are hard to generalize to different tasks.
We adopt a specialization-generalization training strategy and refer to it as Match-Prompt.
In specialization stage, descriptions of different matching tasks are mapped to only a few prompt tokens.
In generalization stage, text matching model explores the essential matching signals by being trained on diverse multiple matching tasks.
arXiv Detail & Related papers (2022-04-06T11:01:08Z) - Pairwise Representation Learning for Event Coreference [73.10563168692667]
We develop a Pairwise Representation Learning (PairwiseRL) scheme for the event mention pairs.
Our representation supports a finer, structured representation of the text snippet to facilitate encoding events and their arguments.
We show that PairwiseRL, despite its simplicity, outperforms the prior state-of-the-art event coreference systems on both cross-document and within-document event coreference benchmarks.
arXiv Detail & Related papers (2020-10-24T06:55:52Z) - Understanding Points of Correspondence between Sentences for Abstractive
Summarization [39.7404761923196]
We present an investigation into fusing sentences drawn from a document by introducing the notion of points of correspondence.
We create a dataset containing the documents, source and fusion sentences, and human annotations of points of correspondence between sentences.
arXiv Detail & Related papers (2020-06-10T02:42:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.