Dependency Parsing as MRC-based Span-Span Prediction
- URL: http://arxiv.org/abs/2105.07654v1
- Date: Mon, 17 May 2021 08:03:48 GMT
- Title: Dependency Parsing as MRC-based Span-Span Prediction
- Authors: Leilei Gan, Yuxian Meng, Kun Kuang, Xiaofei Sun, Chun Fan, Fei Wu and
Jiwei Li
- Abstract summary: Higher-order methods for dependency parsing can partially but not fully address the issue that edges in dependency tree should be constructed at the text span/subtree level rather than word level.
We propose a new method for dependency parsing to address this issue.
- Score: 29.956515394820673
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Higher-order methods for dependency parsing can partially but not fully
addresses the issue that edges in dependency tree should be constructed at the
text span/subtree level rather than word level. % This shortcoming can cause an
incorrect span covered the corresponding tree rooted at a certain word though
the word is correctly linked to its head. In this paper, we propose a new
method for dependency parsing to address this issue. The proposed method
constructs dependency trees by directly modeling span-span (in other words,
subtree-subtree) relations. It consists of two modules: the {\it text span
proposal module} which proposes candidate text spans, each of which represents
a subtree in the dependency tree denoted by (root, start, end); and the {\it
span linking module}, which constructs links between proposed spans. We use the
machine reading comprehension (MRC) framework as the backbone to formalize the
span linking module in an MRC setup, where one span is used as a query to
extract the text span/subtree it should be linked to. The proposed method comes
with the following merits: (1) it addresses the fundamental problem that edges
in a dependency tree should be constructed between subtrees; (2) the MRC
framework allows the method to retrieve missing spans in the span proposal
stage, which leads to higher recall for eligible spans. Extensive experiments
on the PTB, CTB and Universal Dependencies (UD) benchmarks demonstrate the
effectiveness of the proposed method. We are able to achieve new SOTA
performances on PTB and UD benchmarks, and competitive performances to previous
SOTA models on the CTB dataset. Code is available at
https://github.com/ShannonAI/mrc-for-dependency-parsing.
Related papers
- Structured Dialogue Discourse Parsing [79.37200787463917]
discourse parsing aims to uncover the internal structure of a multi-participant conversation.
We propose a principled method that improves upon previous work from two perspectives: encoding and decoding.
Experiments show that our method achieves new state-of-the-art, surpassing the previous model by 2.3 on STAC and 1.5 on Molweni.
arXiv Detail & Related papers (2023-06-26T22:51:01Z) - Incorporating Constituent Syntax for Coreference Resolution [50.71868417008133]
We propose a graph-based method to incorporate constituent syntactic structures.
We also explore to utilise higher-order neighbourhood information to encode rich structures in constituent trees.
Experiments on the English and Chinese portions of OntoNotes 5.0 benchmark show that our proposed model either beats a strong baseline or achieves new state-of-the-art performance.
arXiv Detail & Related papers (2022-02-22T07:40:42Z) - Combining (second-order) graph-based and headed span-based projective
dependency parsing [24.337440797369702]
citetyang2021headed propose a headed span-based method. Both of them score all possible trees and globally find the highest-scoring tree.
In this paper, we combine these two kinds of methods, designing several dynamic programming algorithms for joint inference.
arXiv Detail & Related papers (2021-08-12T16:42:00Z) - Headed Span-Based Projective Dependency Parsing [24.337440797369702]
We propose a headed span-based method for projective dependency parsing.
We use neural networks to score headed spans and design a novel $O(n3)$ dynamic programming algorithm to enable global training and exact inference.
arXiv Detail & Related papers (2021-08-10T15:27:47Z) - RST Parsing from Scratch [14.548146390081778]
We introduce a novel end-to-end formulation of document-level discourse parsing in the Rhetorical Structure Theory (RST) framework.
Our framework facilitates discourse parsing from scratch without requiring discourse segmentation as a prerequisite.
Our unified parsing model adopts a beam search to decode the best tree structure by searching through a space of high-scoring trees.
arXiv Detail & Related papers (2021-05-23T06:19:38Z) - Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic
Parsing [110.97778888305506]
BRIDGE represents the question and DB schema in a tagged sequence where a subset of the fields are augmented with cell values mentioned in the question.
BRIDGE attained state-of-the-art performance on popular cross-DB text-to- relational benchmarks.
Our analysis shows that BRIDGE effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks.
arXiv Detail & Related papers (2020-12-23T12:33:52Z) - Please Mind the Root: Decoding Arborescences for Dependency Parsing [67.71280539312536]
We analyze the output of state-of-the-arts on many languages from the Universal Dependency Treebank.
The worst constraint-violation rate we observe is 24%.
arXiv Detail & Related papers (2020-10-06T08:31:14Z) - Span-based Semantic Parsing for Compositional Generalization [53.24255235340056]
SpanBasedSP predicts a span tree over an input utterance, explicitly encoding how partial programs compose over spans in the input.
On GeoQuery, SCAN and CLOSURE, SpanBasedSP performs similarly to strong seq2seq baselines on random splits, but dramatically improves performance compared to baselines on splits that require compositional generalization.
arXiv Detail & Related papers (2020-09-13T16:42:18Z) - Graph Structured Network for Image-Text Matching [127.68148793548116]
We present a novel Graph Structured Matching Network to learn fine-grained correspondence.
The GSMN explicitly models object, relation and attribute as a structured phrase.
Experiments show that GSMN outperforms state-of-the-art methods on benchmarks.
arXiv Detail & Related papers (2020-04-01T08:20:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.