Source Printer Identification from Document Images Acquired using
Smartphone
- URL: http://arxiv.org/abs/2003.12602v1
- Date: Fri, 27 Mar 2020 18:59:32 GMT
- Title: Source Printer Identification from Document Images Acquired using
Smartphone
- Authors: Sharad Joshi, Suraj Saxena, Nitin Khanna
- Abstract summary: We propose to learn a single CNN model from the fusion of letter images and their printer-specific noise residuals.
The proposed method achieves 98.42% document classification accuracy using images of letter 'e' under a 5x2 cross-validation approach.
- Score: 14.889347839830092
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Vast volumes of printed documents continue to be used for various important
as well as trivial applications. Such applications often rely on the
information provided in the form of printed text documents whose integrity
verification poses a challenge due to time constraints and lack of resources.
Source printer identification provides essential information about the origin
and integrity of a printed document in a fast and cost-effective manner. Even
when fraudulent documents are identified, information about their origin can
help stop future frauds. If a smartphone camera replaces scanner for the
document acquisition process, document forensics would be more economical,
user-friendly, and even faster in many applications where remote and
distributed analysis is beneficial. Building on existing methods, we propose to
learn a single CNN model from the fusion of letter images and their
printer-specific noise residuals. In the absence of any publicly available
dataset, we created a new dataset consisting of 2250 document images of text
documents printed by eighteen printers and acquired by a smartphone camera at
five acquisition settings. The proposed method achieves 98.42% document
classification accuracy using images of letter 'e' under a 5x2 cross-validation
approach. Further, when tested using about half a million letters of all types,
it achieves 90.33% and 98.01% letter and document classification accuracies,
respectively, thus highlighting the ability to learn a discriminative model
without dependence on a single letter type. Also, classification accuracies are
encouraging under various acquisition settings, including low illumination and
change in angle between the document and camera planes.
Related papers
- Watermark Text Pattern Spotting in Document Images [3.6298655794854464]
In the wild, writing can come in various fonts, sizes and forms, making generic recognition a very difficult problem.
We propose a novel benchmark (K-Watermark) containing 65,447 data samples generated using Wrender.
A validity study using humans raters yields an authenticity score of 0.51 against pre-generated watermarked documents.
arXiv Detail & Related papers (2024-01-10T14:02:45Z) - Automatic Recognition of Learning Resource Category in a Digital Library [6.865460045260549]
We introduce the Heterogeneous Learning Resources (HLR) dataset designed for document image classification.
The approach involves decomposing individual learning resources into constituent document images (sheets)
These images are then processed through an OCR tool to extract textual representation.
arXiv Detail & Related papers (2023-11-28T07:48:18Z) - DocMAE: Document Image Rectification via Self-supervised Representation
Learning [144.44748607192147]
We present DocMAE, a novel self-supervised framework for document image rectification.
We first mask random patches of the background-excluded document images and then reconstruct the missing pixels.
With such a self-supervised learning approach, the network is encouraged to learn the intrinsic structure of deformed documents.
arXiv Detail & Related papers (2023-04-20T14:27:15Z) - Deep Unrestricted Document Image Rectification [110.61517455253308]
We present DocTr++, a novel unified framework for document image rectification.
We upgrade the original architecture by adopting a hierarchical encoder-decoder structure for multi-scale representation extraction and parsing.
We contribute a real-world test set and metrics applicable for evaluating the rectification quality.
arXiv Detail & Related papers (2023-04-18T08:00:54Z) - Open Set Classification of Untranscribed Handwritten Documents [56.0167902098419]
Huge amounts of digital page images of important manuscripts are preserved in archives worldwide.
The class or typology'' of a document is perhaps the most important tag to be included in the metadata.
The technical problem is one of automatic classification of documents, each consisting of a set of untranscribed handwritten text images.
arXiv Detail & Related papers (2022-06-20T20:43:50Z) - Fourier Document Restoration for Robust Document Dewarping and
Recognition [73.44057202891011]
This paper presents FDRNet, a Fourier Document Restoration Network that can restore documents with different distortions.
It dewarps documents by a flexible Thin-Plate Spline transformation which can handle various deformations effectively without requiring deformation annotations in training.
It outperforms the state-of-the-art by large margins on both dewarping and text recognition tasks.
arXiv Detail & Related papers (2022-03-18T12:39:31Z) - A Survey on Deep learning based Document Image Enhancement [5.279475826661643]
Digitized documents such as scientific articles, tax forms, invoices, contract papers, and historic texts are widely used nowadays.
These images could be degraded or damaged due to various reasons including poor lighting conditions when capturing the image, shadow while scanning them, distortion like noise and blur, aging, ink stain, bleed through, watermark, stamp, etc.
With recent advances in deep learning, many methods are proposed to enhance the quality of these document images.
arXiv Detail & Related papers (2021-12-06T00:24:50Z) - DocScanner: Robust Document Image Rectification with Progressive
Learning [162.03694280524084]
This work presents DocScanner, a new deep network architecture for document image rectification.
DocScanner maintains a single estimate of the rectified image, which is progressively corrected with a recurrent architecture.
The iterative refinements make DocScanner converge to a robust and superior performance, and the lightweight recurrent architecture ensures the running efficiency.
arXiv Detail & Related papers (2021-10-28T09:15:02Z) - MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document
Analysis [48.35030471041193]
MIDV-2020 consists of 1000 video clips, 2000 scanned images, and 1000 photos of 1000 unique mock identity documents.
With 72409 annotated images in total, to the date of publication the proposed dataset is the largest publicly available identity documents dataset.
arXiv Detail & Related papers (2021-07-01T12:14:17Z) - An Automatic Reader of Identity Documents [0.0]
This paper presents the prototype of a novel automatic reading system of identity documents.
The system has been thought to extract data of the main Italian identity documents from photographs of acceptable quality.
The document is first localized inside the photo, and then classified; finally, text recognition is executed.
arXiv Detail & Related papers (2020-06-26T08:22:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.