EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
- URL: http://arxiv.org/abs/2210.00708v2
- Date: Tue, 4 Jul 2023 13:28:48 GMT
- Title: EraseNet: A Recurrent Residual Network for Supervised Document Cleaning
- Authors: Yashowardhan Shinde, Kishore Kulkarni, Sachin Kuberkar
- Abstract summary: This paper introduces a supervised approach for cleaning dirty documents using a new fully convolutional auto-encoder architecture.
The experiments in this paper have shown promising results as the model is able to learn a variety of ordinary as well as unusual noises and rectify them efficiently.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Document denoising is considered one of the most challenging tasks in
computer vision. There exist millions of documents that are still to be
digitized, but problems like document degradation due to natural and man-made
factors make this task very difficult. This paper introduces a supervised
approach for cleaning dirty documents using a new fully convolutional
auto-encoder architecture. This paper focuses on restoring documents with
discrepancies like deformities caused due to aging of a document, creases left
on the pages that were xeroxed, random black patches, lightly visible text,
etc., and also improving the quality of the image for better optical character
recognition system (OCR) performance. Removing noise from scanned documents is
a very important step before the documents as this noise can severely affect
the performance of an OCR system. The experiments in this paper have shown
promising results as the model is able to learn a variety of ordinary as well
as unusual noises and rectify them efficiently.
Related papers
- DocMAE: Document Image Rectification via Self-supervised Representation
Learning [144.44748607192147]
We present DocMAE, a novel self-supervised framework for document image rectification.
We first mask random patches of the background-excluded document images and then reconstruct the missing pixels.
With such a self-supervised learning approach, the network is encouraged to learn the intrinsic structure of deformed documents.
arXiv Detail & Related papers (2023-04-20T14:27:15Z) - Deep Unrestricted Document Image Rectification [110.61517455253308]
We present DocTr++, a novel unified framework for document image rectification.
We upgrade the original architecture by adopting a hierarchical encoder-decoder structure for multi-scale representation extraction and parsing.
We contribute a real-world test set and metrics applicable for evaluating the rectification quality.
arXiv Detail & Related papers (2023-04-18T08:00:54Z) - ShabbyPages: A Reproducible Document Denoising and Binarization Dataset [59.457999432618614]
ShabbyPages is a new document image dataset designed for training and benchmarking document denoisers and binarizers.
In this paper, we discuss the creation process of ShabbyPages and demonstrate the utility of ShabbyPages by training convolutional denoisers which remove real noise features with a high degree of human-perceptible fidelity.
arXiv Detail & Related papers (2023-03-16T14:19:50Z) - Detection Masking for Improved OCR on Noisy Documents [8.137198664755596]
We present an improved detection network with a masking system to improve the quality of OCR performed on documents.
We perform a unified evaluation on a publicly available dataset demonstrating the usefulness and broad applicability of our method.
arXiv Detail & Related papers (2022-05-17T11:59:18Z) - Fourier Document Restoration for Robust Document Dewarping and
Recognition [73.44057202891011]
This paper presents FDRNet, a Fourier Document Restoration Network that can restore documents with different distortions.
It dewarps documents by a flexible Thin-Plate Spline transformation which can handle various deformations effectively without requiring deformation annotations in training.
It outperforms the state-of-the-art by large margins on both dewarping and text recognition tasks.
arXiv Detail & Related papers (2022-03-18T12:39:31Z) - A Survey on Deep learning based Document Image Enhancement [5.279475826661643]
Digitized documents such as scientific articles, tax forms, invoices, contract papers, and historic texts are widely used nowadays.
These images could be degraded or damaged due to various reasons including poor lighting conditions when capturing the image, shadow while scanning them, distortion like noise and blur, aging, ink stain, bleed through, watermark, stamp, etc.
With recent advances in deep learning, many methods are proposed to enhance the quality of these document images.
arXiv Detail & Related papers (2021-12-06T00:24:50Z) - DocScanner: Robust Document Image Rectification with Progressive
Learning [162.03694280524084]
This work presents DocScanner, a new deep network architecture for document image rectification.
DocScanner maintains a single estimate of the rectified image, which is progressively corrected with a recurrent architecture.
The iterative refinements make DocScanner converge to a robust and superior performance, and the lightweight recurrent architecture ensures the running efficiency.
arXiv Detail & Related papers (2021-10-28T09:15:02Z) - Enhance to Read Better: An Improved Generative Adversarial Network for
Handwritten Document Image Enhancement [1.7491858164568674]
We propose an end to end architecture based on Generative Adversarial Networks (GANs) to recover degraded documents into a clean and readable form.
To the best of our knowledge, this is the first work to use the text information while binarizing handwritten documents.
We outperform the state of the art in H-DIBCO 2018 challenge, after fine tuning our pre-trained model with synthetically degraded Latin handwritten images.
arXiv Detail & Related papers (2021-05-26T17:44:45Z) - End-to-End Unsupervised Document Image Blind Denoising [0.8717253904965373]
We propose a unified end-to-end unsupervised deep learning model, for the first time, that can effectively remove multiple types of noise.
We demonstrate that the proposed model significantly improves the quality of scanned images and the OCR of the pages on several test datasets.
arXiv Detail & Related papers (2021-05-19T23:55:15Z) - DE-GAN: A Conditional Generative Adversarial Network for Document
Enhancement [4.073826298938431]
We propose an end-to-end framework named Document Enhancement Geneversarative Adrial Networks (DE-GAN) to restore severely degraded document images.
We demonstrate that, in different tasks (document clean up, binarization, deblurring and watermark removal), DE-GAN can produce an enhanced version of the degraded document with a high quality.
arXiv Detail & Related papers (2020-10-17T10:54:49Z) - Self-supervised Deep Reconstruction of Mixed Strip-shredded Text
Documents [63.41717168981103]
This work extends our previous deep learning method for single-page reconstruction to a more realistic/complex scenario.
In our approach, the compatibility evaluation is modeled as a two-class (valid or invalid) pattern recognition problem.
The proposed method outperforms the competing ones on complex scenarios, achieving accuracy superior to 90%.
arXiv Detail & Related papers (2020-07-01T21:48:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.