Document Dewarping with Control Points
- URL: http://arxiv.org/abs/2203.10543v1
- Date: Sun, 20 Mar 2022 12:51:14 GMT
- Title: Document Dewarping with Control Points
- Authors: Guo-Wang Xie, Fei Yin, Xu-Yao Zhang, and Cheng-Lin Liu
- Abstract summary: We propose a simple yet effective approach to rectify distorted document image by estimating control points and reference points.
Control points are controllable to facilitate interaction or subsequent adjustment.
Experiments show that our approach can rectify document images with various distortion types, and yield state-of-the-art performance on real-world dataset.
- Score: 36.32190493389662
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Document images are now widely captured by handheld devices such as mobile
phones. The OCR performance on these images are largely affected due to
geometric distortion of the document paper, diverse camera positions and
complex backgrounds. In this paper, we propose a simple yet effective approach
to rectify distorted document image by estimating control points and reference
points. After that, we use interpolation method between control points and
reference points to convert sparse mappings to backward mapping, and remap the
original distorted document image to the rectified image. Furthermore, control
points are controllable to facilitate interaction or subsequent adjustment. We
can flexibly select post-processing methods and the number of vertices
according to different application scenarios. Experiments show that our
approach can rectify document images with various distortion types, and yield
state-of-the-art performance on real-world dataset. This paper also provides a
training dataset based on control points for document dewarping. Both the code
and the dataset are released at
https://github.com/gwxie/Document-Dewarping-with-Control-Points.
Related papers
- DocMAE: Document Image Rectification via Self-supervised Representation
Learning [144.44748607192147]
We present DocMAE, a novel self-supervised framework for document image rectification.
We first mask random patches of the background-excluded document images and then reconstruct the missing pixels.
With such a self-supervised learning approach, the network is encouraged to learn the intrinsic structure of deformed documents.
arXiv Detail & Related papers (2023-04-20T14:27:15Z) - Deep Unrestricted Document Image Rectification [110.61517455253308]
We present DocTr++, a novel unified framework for document image rectification.
We upgrade the original architecture by adopting a hierarchical encoder-decoder structure for multi-scale representation extraction and parsing.
We contribute a real-world test set and metrics applicable for evaluating the rectification quality.
arXiv Detail & Related papers (2023-04-18T08:00:54Z) - ObjectFormer for Image Manipulation Detection and Localization [118.89882740099137]
We propose ObjectFormer to detect and localize image manipulations.
We extract high-frequency features of the images and combine them with RGB features as multimodal patch embeddings.
We conduct extensive experiments on various datasets and the results verify the effectiveness of the proposed method.
arXiv Detail & Related papers (2022-03-28T12:27:34Z) - DocScanner: Robust Document Image Rectification with Progressive
Learning [162.03694280524084]
This work presents DocScanner, a new deep network architecture for document image rectification.
DocScanner maintains a single estimate of the rectified image, which is progressively corrected with a recurrent architecture.
The iterative refinements make DocScanner converge to a robust and superior performance, and the lightweight recurrent architecture ensures the running efficiency.
arXiv Detail & Related papers (2021-10-28T09:15:02Z) - DocTr: Document Image Transformer for Geometric Unwarping and
Illumination Correction [99.09177377916369]
We propose Document Image Transformer (DocTr) to address the issue of geometry and illumination distortion of the document images.
Our DocTr achieves 20.02% Character Error Rate (CER), a 15% absolute improvement over the state-of-the-art methods.
arXiv Detail & Related papers (2021-10-25T13:27:10Z) - Dewarping Document Image By Displacement Flow Estimation with Fully
Convolutional Network [30.18238229156996]
We propose a framework for both rectifying distorted document image and removing background finely, using a fully convolutional network (FCN)
The FCN is trained by regressing displacements of synthesized distorted documents, and to control the smoothness of displacements, we propose a Local Smooth Constraint (LSC) in regularization.
Experiments proved that our approach can dewarp document images effectively under various geometric distortions, and has achieved the state-of-the-art performance in terms of local details and overall effect.
arXiv Detail & Related papers (2021-04-14T12:32:36Z) - RectiNet-v2: A stacked network architecture for document image dewarping [16.249023269158734]
We propose an end-to-end CNN architecture that can produce distortion free document images from warped documents it takes as input.
We train this model on warped document images simulated synthetically to compensate for lack of enough natural data.
We evaluate our method on the DocUNet dataset, a benchmark in this domain, and obtain results comparable to state-of-the-art methods.
arXiv Detail & Related papers (2021-02-01T19:26:17Z) - Can You Read Me Now? Content Aware Rectification using Angle Supervision [14.095728009592763]
We present CREASE: Content Aware Rectification using Angle Supervision, the first learned method for document rectification.
Our method surpasses previous approaches in terms of OCR accuracy, geometric error and visual similarity.
arXiv Detail & Related papers (2020-08-05T16:58:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.