A Top-Down Neural Architecture towards Text-Level Parsing of Discourse
Rhetorical Structure
- URL: http://arxiv.org/abs/2005.02680v4
- Date: Wed, 19 May 2021 11:35:10 GMT
- Title: A Top-Down Neural Architecture towards Text-Level Parsing of Discourse
Rhetorical Structure
- Authors: Longyin Zhang, Yuqing Xing, Fang Kong, Peifeng Li, Guodong Zhou
- Abstract summary: We propose a top-down neural architecture toward text-level DRS parsing.
We cast discourse parsing as a split point ranking task, where a split point is classified to different levels according to its rank.
In this way, we can determine the complete DRS as a hierarchical tree structure via an encoder-decoder with an internal stack.
- Score: 27.927104697483934
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Due to its great importance in deep natural language understanding and
various down-stream applications, text-level parsing of discourse rhetorical
structure (DRS) has been drawing more and more attention in recent years.
However, all the previous studies on text-level discourse parsing adopt
bottom-up approaches, which much limit the DRS determination on local
information and fail to well benefit from global information of the overall
discourse. In this paper, we justify from both computational and perceptive
points-of-view that the top-down architecture is more suitable for text-level
DRS parsing. On the basis, we propose a top-down neural architecture toward
text-level DRS parsing. In particular, we cast discourse parsing as a recursive
split point ranking task, where a split point is classified to different levels
according to its rank and the elementary discourse units (EDUs) associated with
it are arranged accordingly. In this way, we can determine the complete DRS as
a hierarchical tree structure via an encoder-decoder with an internal stack.
Experimentation on both the English RST-DT corpus and the Chinese CDTB corpus
shows the great effectiveness of our proposed top-down approach towards
text-level DRS parsing.
Related papers
- RST-style Discourse Parsing Guided by Document-level Content Structures [27.28989421841165]
Existing RST parsing pipelines construct rhetorical structures without the knowledge of document-level content structures.
We propose a novel pipeline for RST-DP that incorporates structure-aware news content sentence representations.
arXiv Detail & Related papers (2023-09-08T05:50:27Z) - Revisiting Conversation Discourse for Dialogue Disentanglement [88.3386821205896]
We propose enhancing dialogue disentanglement by taking full advantage of the dialogue discourse characteristics.
We develop a structure-aware framework to integrate the rich structural features for better modeling the conversational semantic context.
Our work has great potential to facilitate broader multi-party multi-thread dialogue applications.
arXiv Detail & Related papers (2023-06-06T19:17:47Z) - TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision [61.186488081379]
We propose TextFormer, a query-based end-to-end text spotter with Transformer architecture.
TextFormer builds upon an image encoder and a text decoder to learn a joint semantic understanding for multi-task modeling.
It allows for mutual training and optimization of classification, segmentation, and recognition branches, resulting in deeper feature sharing.
arXiv Detail & Related papers (2023-06-06T03:37:41Z) - Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark [44.06803331843307]
paragraph-level topic structure can grasp and understand the overall context of a document from a higher level.
The lack of large-scale, high-quality Chinese paragraph-level topic structure corpora restrained research and applications.
We propose a hierarchical paragraph-level topic structure representation with three layers to guide the corpus construction.
We employ a two-stage man-machine collaborative annotation method to construct the largest Chinese paragraph-level Topic Structure corpus.
arXiv Detail & Related papers (2023-05-24T06:43:23Z) - Topic-driven Distant Supervision Framework for Macro-level Discourse
Parsing [72.14449502499535]
The task of analyzing the internal rhetorical structure of texts is a challenging problem in natural language processing.
Despite the recent advances in neural models, the lack of large-scale, high-quality corpora for training remains a major obstacle.
Recent studies have attempted to overcome this limitation by using distant supervision.
arXiv Detail & Related papers (2023-05-23T07:13:51Z) - Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue:
An Empirical Study [51.079100495163736]
This paper systematically inspects ChatGPT's performance in two discourse analysis tasks: topic segmentation and discourse parsing.
ChatGPT demonstrates proficiency in identifying topic structures in general-domain conversations yet struggles considerably in specific-domain conversations.
Our deeper investigation indicates that ChatGPT can give more reasonable topic structures than human annotations but only linearly parses the hierarchical rhetorical structures.
arXiv Detail & Related papers (2023-05-15T07:14:41Z) - Global and Local Hierarchy-aware Contrastive Framework for Implicit
Discourse Relation Recognition [8.143877598684528]
implicit discourse relation recognition (IDRR) is a challenging task in discourse analysis.
Recent methods tend to integrate the whole hierarchical information of senses into discourse relation representations.
We propose a novel GlObal and Local Hierarchy-aware Contrastive Framework (GOLF), to model two kinds of hierarchies.
arXiv Detail & Related papers (2022-11-25T03:19:03Z) - RST Parsing from Scratch [14.548146390081778]
We introduce a novel end-to-end formulation of document-level discourse parsing in the Rhetorical Structure Theory (RST) framework.
Our framework facilitates discourse parsing from scratch without requiring discourse segmentation as a prerequisite.
Our unified parsing model adopts a beam search to decode the best tree structure by searching through a space of high-scoring trees.
arXiv Detail & Related papers (2021-05-23T06:19:38Z) - An End-to-End Document-Level Neural Discourse Parser Exploiting
Multi-Granularity Representations [24.986030179701405]
We exploit robust representations derived from multiple levels of granularity across syntax and semantics.
We incorporate such representations in an end-to-end encoder-decoder neural architecture for more resourceful discourse processing.
arXiv Detail & Related papers (2020-12-21T08:01:04Z) - Hierarchical Bi-Directional Self-Attention Networks for Paper Review
Rating Recommendation [81.55533657694016]
We propose a Hierarchical bi-directional self-attention Network framework (HabNet) for paper review rating prediction and recommendation.
Specifically, we leverage the hierarchical structure of the paper reviews with three levels of encoders: sentence encoder (level one), intra-review encoder (level two) and inter-review encoder (level three)
We are able to identify useful predictors to make the final acceptance decision, as well as to help discover the inconsistency between numerical review ratings and text sentiment conveyed by reviewers.
arXiv Detail & Related papers (2020-11-02T08:07:50Z) - Abstractive Summarization of Spoken and Written Instructions with BERT [66.14755043607776]
We present the first application of the BERTSum model to conversational language.
We generate abstractive summaries of narrated instructional videos across a wide variety of topics.
We envision this integrated as a feature in intelligent virtual assistants, enabling them to summarize both written and spoken instructional content upon request.
arXiv Detail & Related papers (2020-08-21T20:59:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.