Entity Relation Extraction as Dependency Parsing in Visually Rich
Documents
- URL: http://arxiv.org/abs/2110.09915v1
- Date: Tue, 19 Oct 2021 12:26:40 GMT
- Title: Entity Relation Extraction as Dependency Parsing in Visually Rich
Documents
- Authors: Yue Zhang, Bo Zhang, Rui Wang, Junjie Cao, Chen Li, Zuyi Bao
- Abstract summary: We adapt the popular dependency parsing model, the biaffine, to this entity relation extraction task.
Being different from the original dependency parsing model which recognizes dependency relations between words, we identify relations between groups of words with layout information instead.
As for the real-world application, our model has been applied to the in-house customs data, achieving reliable performance in the production setting.
- Score: 18.67730663266417
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Previous works on key information extraction from visually rich documents
(VRDs) mainly focus on labeling the text within each bounding box (i.e.,
semantic entity), while the relations in-between are largely unexplored. In
this paper, we adapt the popular dependency parsing model, the biaffine parser,
to this entity relation extraction task. Being different from the original
dependency parsing model which recognizes dependency relations between words,
we identify relations between groups of words with layout information instead.
We have compared different representations of the semantic entity, different
VRD encoders, and different relation decoders. The results demonstrate that our
proposed model achieves 65.96% F1 score on the FUNSD dataset. As for the
real-world application, our model has been applied to the in-house customs
data, achieving reliable performance in the production setting.
Related papers
- Hypergraph based Understanding for Document Semantic Entity Recognition [65.84258776834524]
We build a novel hypergraph attention document semantic entity recognition framework, HGA, which uses hypergraph attention to focus on entity boundaries and entity categories at the same time.
Our results on FUNSD, CORD, XFUNDIE show that our method can effectively improve the performance of semantic entity recognition tasks.
arXiv Detail & Related papers (2024-07-09T14:35:49Z) - Relation Rectification in Diffusion Model [64.84686527988809]
We introduce a novel task termed Relation Rectification, aiming to refine the model to accurately represent a given relationship it initially fails to generate.
We propose an innovative solution utilizing a Heterogeneous Graph Convolutional Network (HGCN)
The lightweight HGCN adjusts the text embeddings generated by the text encoder, ensuring the accurate reflection of the textual relation in the embedding space.
arXiv Detail & Related papers (2024-03-29T15:54:36Z) - A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction [12.286432133599355]
Document-level Event Argument Extraction (DEAE) aims to identify arguments and their specific roles from an unstructured document.
advanced approaches on DEAE utilize prompt-based methods to guide pre-trained language models (PLMs) in extracting arguments from input documents.
We propose a semantic mention Graph Augmented Model (GAM) to address these two problems in this paper.
arXiv Detail & Related papers (2024-03-12T08:58:07Z) - Modeling Entities as Semantic Points for Visual Information Extraction
in the Wild [55.91783742370978]
We propose an alternative approach to precisely and robustly extract key information from document images.
We explicitly model entities as semantic points, i.e., center points of entities are enriched with semantic information describing the attributes and relationships of different entities.
The proposed method can achieve significantly enhanced performance on entity labeling and linking, compared with previous state-of-the-art models.
arXiv Detail & Related papers (2023-03-23T08:21:16Z) - Relation-Specific Attentions over Entity Mentions for Enhanced
Document-Level Relation Extraction [4.685620089585031]
We propose RSMAN which performs selective attentions over different entity mentions with respect to candidate relations.
Our experiments upon two benchmark datasets show that our RSMAN can bring significant improvements for some backbone models.
arXiv Detail & Related papers (2022-05-28T10:40:31Z) - Document-Level Relation Extraction with Sentences Importance Estimation
and Focusing [52.069206266557266]
Document-level relation extraction (DocRE) aims to determine the relation between two entities from a document of multiple sentences.
We propose a Sentence Estimation and Focusing (SIEF) framework for DocRE, where we design a sentence importance score and a sentence focusing loss.
Experimental results on two domains show that our SIEF not only improves overall performance, but also makes DocRE models more robust.
arXiv Detail & Related papers (2022-04-27T03:20:07Z) - A Trigger-Sense Memory Flow Framework for Joint Entity and Relation
Extraction [5.059120569845976]
We present a Trigger-Sense Memory Flow Framework (TriMF) for joint entity and relation extraction.
We build a memory module to remember category representations learned in entity recognition and relation extraction tasks.
We also design a multi-level memory flow attention mechanism to enhance the bi-directional interaction between entity recognition and relation extraction.
arXiv Detail & Related papers (2021-01-25T16:24:04Z) - Document-Level Relation Extraction with Reconstruction [28.593318203728963]
We propose a novel encoder-classifier-reconstructor model for document-level relation extraction (DocRE)
The reconstructor reconstructs the ground-truth path dependencies from the graph representation, to ensure that the proposed DocRE model pays more attention to encode entity pairs with relationships in the training.
Experimental results on a large-scale DocRE dataset show that the proposed model can significantly improve the accuracy of relation extraction on a strong heterogeneous graph-based baseline.
arXiv Detail & Related papers (2020-12-21T14:29:31Z) - Learning Relation Prototype from Unlabeled Texts for Long-tail Relation
Extraction [84.64435075778988]
We propose a general approach to learn relation prototypes from unlabeled texts.
We learn relation prototypes as an implicit factor between entities.
We conduct experiments on two publicly available datasets: New York Times and Google Distant Supervision.
arXiv Detail & Related papers (2020-11-27T06:21:12Z) - Relation Extraction with Contextualized Relation Embedding (CRE) [6.030060645424665]
This paper proposes an architecture for the relation extraction task that integrates semantic information with knowledge base modeling.
We present a model architecture that internalizes KB modeling in relation extraction.
The proposed CRE model achieves state of the art performance on datasets derived from The New York Times Annotated Corpus and FreeBase.
arXiv Detail & Related papers (2020-11-19T05:19:46Z) - Relation of the Relations: A New Paradigm of the Relation Extraction
Problem [52.21210549224131]
We propose a new paradigm of Relation Extraction (RE) that considers as a whole the predictions of all relations in the same context.
We develop a data-driven approach that does not require hand-crafted rules but learns by itself the relation of relations (RoR) using Graph Neural Networks and a relation matrix transformer.
Experiments show that our model outperforms the state-of-the-art approaches by +1.12% on the ACE05 dataset and +2.55% on SemEval 2018 Task 7.2.
arXiv Detail & Related papers (2020-06-05T22:25:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.