Self-supervised Deep Reconstruction of Mixed Strip-shredded Text
Documents
- URL: http://arxiv.org/abs/2007.00779v1
- Date: Wed, 1 Jul 2020 21:48:05 GMT
- Title: Self-supervised Deep Reconstruction of Mixed Strip-shredded Text
Documents
- Authors: Thiago M. Paix\~ao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro
L. Koerich, Claudine Badue, Alberto F. de Souza, Thiago Oliveira-Santos
- Abstract summary: This work extends our previous deep learning method for single-page reconstruction to a more realistic/complex scenario.
In our approach, the compatibility evaluation is modeled as a two-class (valid or invalid) pattern recognition problem.
The proposed method outperforms the competing ones on complex scenarios, achieving accuracy superior to 90%.
- Score: 63.41717168981103
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The reconstruction of shredded documents consists of coherently arranging
fragments of paper (shreds) to recover the original document(s). A great
challenge in computational reconstruction is to properly evaluate the
compatibility between the shreds. While traditional pixel-based approaches are
not robust to real shredding, more sophisticated solutions compromise
significantly time performance. The solution presented in this work extends our
previous deep learning method for single-page reconstruction to a more
realistic/complex scenario: the reconstruction of several mixed shredded
documents at once. In our approach, the compatibility evaluation is modeled as
a two-class (valid or invalid) pattern recognition problem. The model is
trained in a self-supervised manner on samples extracted from
simulated-shredded documents, which obviates manual annotation. Experimental
results on three datasets -- including a new collection of 100 strip-shredded
documents produced for this work -- have shown that the proposed method
outperforms the competing ones on complex scenarios, achieving accuracy
superior to 90%.
Related papers
- DocMAE: Document Image Rectification via Self-supervised Representation
Learning [144.44748607192147]
We present DocMAE, a novel self-supervised framework for document image rectification.
We first mask random patches of the background-excluded document images and then reconstruct the missing pixels.
With such a self-supervised learning approach, the network is encouraged to learn the intrinsic structure of deformed documents.
arXiv Detail & Related papers (2023-04-20T14:27:15Z) - Geometric Rectification of Creased Document Images based on Isometric
Mapping [0.0]
Geometric rectification of images of distorted documents finds wide applications in document digitization and Optical Character Recognition (OCR)
We propose a general framework of document image rectification in which a computational isometric mapping model is utilized for expressing a 3D document model and its flattening in the plane.
Experiments and comparisons to the state-of-the-art approaches demonstrated the effectiveness and outstanding performance of the proposed method.
arXiv Detail & Related papers (2022-12-16T09:33:31Z) - HandFlow: Quantifying View-Dependent 3D Ambiguity in Two-Hand
Reconstruction with Normalizing Flow [73.7895717883622]
We explicitly model the distribution of plausible reconstructions in a conditional normalizing flow framework.
We show that explicit ambiguity modeling is better-suited for this challenging problem.
arXiv Detail & Related papers (2022-10-04T15:42:22Z) - CODER: An efficient framework for improving retrieval through
COntextualized Document Embedding Reranking [11.635294568328625]
We present a framework for improving the performance of a wide class of retrieval models at minimal computational cost.
It utilizes precomputed document representations extracted by a base dense retrieval method.
It incurs a negligible computational overhead on top of any first-stage method at run time, allowing it to be easily combined with any state-of-the-art dense retrieval method.
arXiv Detail & Related papers (2021-12-16T10:25:26Z) - Multi-Fact Correction in Abstractive Text Summarization [98.27031108197944]
Span-Fact is a suite of two factual correction models that leverages knowledge learned from question answering models to make corrections in system-generated summaries via span selection.
Our models employ single or multi-masking strategies to either iteratively or auto-regressively replace entities in order to ensure semantic consistency w.r.t. the source text.
Experiments show that our models significantly boost the factual consistency of system-generated summaries without sacrificing summary quality in terms of both automatic metrics and human evaluation.
arXiv Detail & Related papers (2020-10-06T02:51:02Z) - A Divide-and-Conquer Approach to the Summarization of Long Documents [4.863209463405628]
We present a novel divide-and-conquer method for the neural summarization of long documents.
Our method exploits the discourse structure of the document and uses sentence similarity to split the problem into smaller summarization problems.
We demonstrate that this approach paired with different summarization models, including sequence-to-sequence RNNs and Transformers, can lead to improved summarization performance.
arXiv Detail & Related papers (2020-04-13T20:38:49Z) - Pre-training for Abstractive Document Summarization by Reinstating
Source Text [105.77348528847337]
This paper presents three pre-training objectives which allow us to pre-train a Seq2Seq based abstractive summarization model on unlabeled text.
Experiments on two benchmark summarization datasets show that all three objectives can improve performance upon baselines.
arXiv Detail & Related papers (2020-04-04T05:06:26Z) - Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised
Deep Asymmetric Metric Learning [62.34197797857823]
A central problem in automatic reconstruction of shredded documents is the pairwise compatibility evaluation of the shreds.
This work proposes a scalable deep learning approach for measuring pairwise compatibility in which the number of inferences scales linearly.
Our method has accuracy comparable to the state-of-the-art with a speed-up of about 22 times for a test instance with 505 shreds.
arXiv Detail & Related papers (2020-03-23T03:22:06Z) - Text Extraction and Restoration of Old Handwritten Documents [3.514869837986596]
This paper describes two novel methods for the restoration of old degraded handwritten documents using deep neural network.
A small-scale dataset of 26 heritage letters images is introduced.
Experiments demonstrate that the proposed systems perform well on handwritten document images with severe degradations.
arXiv Detail & Related papers (2020-01-23T05:42:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.