FCDS: Fusing Constituency and Dependency Syntax into Document-Level
Relation Extraction
- URL: http://arxiv.org/abs/2403.01886v1
- Date: Mon, 4 Mar 2024 09:48:55 GMT
- Title: FCDS: Fusing Constituency and Dependency Syntax into Document-Level
Relation Extraction
- Authors: Xudong Zhu, Zhao Kang, Bei Hui
- Abstract summary: Document-level Relation Extraction (DocRE) aims to identify relation labels between entities within a single document.
We propose to fuse constituency and dependency syntax into DocRE.
The experimental results on datasets from various domains demonstrate the effectiveness of the proposed method.
- Score: 6.293453766383407
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Document-level Relation Extraction (DocRE) aims to identify relation labels
between entities within a single document. It requires handling several
sentences and reasoning over them. State-of-the-art DocRE methods use a graph
structure to connect entities across the document to capture dependency syntax
information. However, this is insufficient to fully exploit the rich syntax
information in the document. In this work, we propose to fuse constituency and
dependency syntax into DocRE. It uses constituency syntax to aggregate the
whole sentence information and select the instructive sentences for the pairs
of targets. It exploits the dependency syntax in a graph structure with
constituency syntax enhancement and chooses the path between entity pairs based
on the dependency graph. The experimental results on datasets from various
domains demonstrate the effectiveness of the proposed method. The code is
publicly available at this url.
Related papers
- Data-driven Coreference-based Ontology Building [48.995395445597225]
Coreference resolution is traditionally used as a component in individual document understanding.
We take a more global view and explore what can we learn about a domain from the set of all document-level coreference relations.
We release the coreference chains resulting under a creative-commons license, along with the code.
arXiv Detail & Related papers (2024-10-22T14:30:40Z) - Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval [49.42043077545341]
We propose a knowledge-aware query expansion framework, augmenting LLMs with structured document relations from knowledge graph (KG)
We leverage document texts as rich KG node representations and use document-based relation filtering for our Knowledge-Aware Retrieval (KAR)
arXiv Detail & Related papers (2024-10-17T17:03:23Z) - Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification [20.434941308959786]
Long document classification presents challenges due to their extensive content and complex structure.
Existing methods often struggle with token limits and fail to adequately model hierarchical relationships within documents.
Our approach integrates syntax trees for sentence encodings and document graphs for document encodings, which capture fine-grained syntactic relationships and broader document contexts.
arXiv Detail & Related papers (2024-10-03T19:25:01Z) - Contextual Document Embeddings [77.22328616983417]
We propose two complementary methods for contextualized document embeddings.
First, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss.
Second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation.
arXiv Detail & Related papers (2024-10-03T14:33:34Z) - A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction [12.286432133599355]
Document-level Event Argument Extraction (DEAE) aims to identify arguments and their specific roles from an unstructured document.
advanced approaches on DEAE utilize prompt-based methods to guide pre-trained language models (PLMs) in extracting arguments from input documents.
We propose a semantic mention Graph Augmented Model (GAM) to address these two problems in this paper.
arXiv Detail & Related papers (2024-03-12T08:58:07Z) - Anaphor Assisted Document-Level Relation Extraction [42.57958231709678]
Document-level relation extraction (DocRE) involves identifying relations between entities distributed in multiple sentences within a document.
Existing methods focus on building a heterogeneous document graph to model the internal structure of an entity and the external interaction between entities.
We propose an Anaphor-Assisted (AA) framework for DocRE tasks.
arXiv Detail & Related papers (2023-10-28T06:11:18Z) - DocTr: Document Transformer for Structured Information Extraction in
Documents [36.1145541816468]
We present a new formulation for structured information extraction from visually rich documents.
It aims to address the limitations of existing IOB tagging or graph-based formulations.
We represent an entity as an anchor word and a bounding box, and represent entity linking as the association between anchor words.
arXiv Detail & Related papers (2023-07-16T02:59:30Z) - Not Just Plain Text! Fuel Document-Level Relation Extraction with
Explicit Syntax Refinement and Subsentence Modeling [3.9436257406798925]
We propose expLicit syntAx Refinement and Subsentence mOdeliNg based framework (LARSON)
By introducing extra syntactic information, LARSON can model subsentences of arbitrary granularity and efficiently screen instructive ones.
Experimental results on three benchmark datasets (DocRED, CDR, and GDA) demonstrate that LARSON significantly outperforms existing methods.
arXiv Detail & Related papers (2022-11-10T05:06:37Z) - Document-Level Relation Extraction with Sentences Importance Estimation
and Focusing [52.069206266557266]
Document-level relation extraction (DocRE) aims to determine the relation between two entities from a document of multiple sentences.
We propose a Sentence Estimation and Focusing (SIEF) framework for DocRE, where we design a sentence importance score and a sentence focusing loss.
Experimental results on two domains show that our SIEF not only improves overall performance, but also makes DocRE models more robust.
arXiv Detail & Related papers (2022-04-27T03:20:07Z) - DART: Open-Domain Structured Data Record to Text Generation [91.23798751437835]
We present DART, an open domain structured DAta Record to Text generation dataset with over 82k instances (DARTs)
We propose a procedure of extracting semantic triples from tables that encode their structures by exploiting the semantic dependencies among table headers and the table title.
Our dataset construction framework effectively merged heterogeneous sources from open domain semantic parsing and dialogue-act-based meaning representation tasks.
arXiv Detail & Related papers (2020-07-06T16:35:30Z) - Relational Message Passing for Knowledge Graph Completion [78.47976646383222]
We propose a relational message passing method for knowledge graph completion.
It passes relational messages among edges iteratively to aggregate neighborhood information.
Results show our method outperforms stateof-the-art knowledge completion methods by a large margin.
arXiv Detail & Related papers (2020-02-17T03:33:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.