Neural Graph Matching for Modification Similarity Applied to Electronic
Document Comparison
- URL: http://arxiv.org/abs/2204.05486v1
- Date: Tue, 12 Apr 2022 02:37:54 GMT
- Title: Neural Graph Matching for Modification Similarity Applied to Electronic
Document Comparison
- Authors: Po-Fang Hsu, Chiching Wei
- Abstract summary: Document comparison is a common task in the legal and financial industries.
In this paper, we present a novel neural graph matching approach applied to document comparison.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this paper, we present a novel neural graph matching approach applied to
document comparison. Document comparison is a common task in the legal and
financial industries. In some cases, the most important differences may be the
addition or omission of words, sentences, clauses, or paragraphs. However, it
is a challenging task without recording or tracing whole edited process. Under
many temporal uncertainties, we explore the potentiality of our approach to
proximate the accurate comparison to make sure which element blocks have a
relation of edition with others. In beginning, we apply a document layout
analysis that combining traditional and modern technics to segment layout in
blocks of various types appropriately. Then we transform this issue to a
problem of layout graph matching with textual awareness. About graph matching,
it is a long-studied problem with a broad range of applications. However,
different from previous works focusing on visual images or structural layout,
we also bring textual features into our model for adapting this domain.
Specifically, based on the electronic document, we introduce an encoder to deal
with the visual presentation decoding from PDF. Additionally, because the
modifications can cause the inconsistency of document layout analysis between
modified documents and the blocks can be merged and split, Sinkhorn divergence
is adopted in our graph neural approach, which tries to overcome both these
issues with many-to-many block matching. We demonstrate this on two categories
of layouts, as follows., legal agreement and scientific articles, collected
from our real-case datasets.
Related papers
- Contextual Document Embeddings [77.22328616983417]
We propose two complementary methods for contextualized document embeddings.
First, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss.
Second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation.
arXiv Detail & Related papers (2024-10-03T14:33:34Z) - Enhancing Visually-Rich Document Understanding via Layout Structure
Modeling [91.07963806829237]
We propose GraphLM, a novel document understanding model that injects layout knowledge into the model.
We evaluate our model on various benchmarks, including FUNSD, XFUND and CORD, and achieve state-of-the-art results.
arXiv Detail & Related papers (2023-08-15T13:53:52Z) - GVdoc: Graph-based Visual Document Classification [17.350393956461783]
We propose GVdoc, a graph-based document classification model.
Our approach generates a document graph based on its layout, and then trains a graph neural network to learn node and graph embeddings.
We show that our model, even with fewer parameters, outperforms state-of-the-art models on out-of-distribution data.
arXiv Detail & Related papers (2023-05-26T19:23:20Z) - Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text
Documents via Semantic-Oriented Hierarchical Graphs [79.0426838808629]
We propose TAT-DQA, i.e. to answer the question over a visually-rich table-text document.
Specifically, we propose a novel Doc2SoarGraph framework with enhanced discrete reasoning capability.
We conduct extensive experiments on TAT-DQA dataset, and the results show that our proposed framework outperforms the best baseline model by 17.73% and 16.91% in terms of Exact Match (EM) and F1 score respectively on the test set.
arXiv Detail & Related papers (2023-05-03T07:30:32Z) - SelfDocSeg: A Self-Supervised vision-based Approach towards Document
Segmentation [15.953725529361874]
Document layout analysis is a known problem to the documents research community.
With growing internet connectivity to personal life, an enormous amount of documents had been available in the public domain.
We address this challenge using self-supervision and unlike, the few existing self-supervised document segmentation approaches.
arXiv Detail & Related papers (2023-05-01T12:47:55Z) - Entry Separation using a Mixed Visual and Textual Language Model:
Application to 19th century French Trade Directories [18.323615434182553]
A key challenge is to correctly segment what constitutes the basic text regions for the target database.
We propose a new pragmatic approach whose efficiency is demonstrated on 19th century French Trade Directories.
By injecting special visual tokens, coding, for instance, indentation or breaks, into the token stream of the language model used for NER purpose, we can leverage both textual and visual knowledge simultaneously.
arXiv Detail & Related papers (2023-02-17T15:30:44Z) - Page Layout Analysis of Text-heavy Historical Documents: a Comparison of
Textual and Visual Approaches [0.0]
Page layout analysis is a fundamental step in document processing which enables to segment a page into regions of interest.
With highly complex layouts and mixed scripts, scholarly annotated are text-heavy documents which remain challenging for state-of-the-art models.
arXiv Detail & Related papers (2022-12-12T10:10:29Z) - FactGraph: Evaluating Factuality in Summarization with Semantic Graph
Representations [114.94628499698096]
We propose FactGraph, a method that decomposes the document and the summary into structured meaning representations (MRs)
MRs describe core semantic concepts and their relations, aggregating the main content in both document and summary in a canonical form, and reducing data sparsity.
Experiments on different benchmarks for evaluating factuality show that FactGraph outperforms previous approaches by up to 15%.
arXiv Detail & Related papers (2022-04-13T16:45:33Z) - Comprehensive Studies for Arbitrary-shape Scene Text Detection [78.50639779134944]
We propose a unified framework for the bottom-up based scene text detection methods.
Under the unified framework, we ensure the consistent settings for non-core modules.
With the comprehensive investigations and elaborate analyses, it reveals the advantages and disadvantages of previous models.
arXiv Detail & Related papers (2021-07-25T13:18:55Z) - Multilevel Text Alignment with Cross-Document Attention [59.76351805607481]
Existing alignment methods operate at a single, predefined level.
We propose a new learning approach that equips previously established hierarchical attention encoders for representing documents with a cross-document attention component.
arXiv Detail & Related papers (2020-10-03T02:52:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.