Image Generation and Learning Strategy for Deep Document Forgery
Detection
- URL: http://arxiv.org/abs/2311.03650v1
- Date: Tue, 7 Nov 2023 01:40:00 GMT
- Title: Image Generation and Learning Strategy for Deep Document Forgery
Detection
- Authors: Yamato Okamoto, Osada Genki, Iu Yahiro, Rintaro Hasegawa, Peifei Zhu,
Hirokatsu Kataoka
- Abstract summary: Recent advancements in deep neural network (DNN) methods for generative tasks may amplify the threat of document forgery.
We construct a training dataset of document forgery images, named FD-VIED, by emulating possible attacks.
In our experiments, we demonstrate that our approach enhances detection performance.
- Score: 7.585489507445007
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, document processing has flourished and brought numerous
benefits. However, there has been a significant rise in reported cases of
forged document images. Specifically, recent advancements in deep neural
network (DNN) methods for generative tasks may amplify the threat of document
forgery. Traditional approaches for forged document images created by prevalent
copy-move methods are unsuitable against those created by DNN-based methods, as
we have verified. To address this issue, we construct a training dataset of
document forgery images, named FD-VIED, by emulating possible attacks, such as
text addition, removal, and replacement with recent DNN-methods. Additionally,
we introduce an effective pre-training approach through self-supervised
learning with both natural images and document images. In our experiments, we
demonstrate that our approach enhances detection performance.
Related papers
- Multi-modal Document Presentation Attack Detection With Forensics Trace Disentanglement [22.751498009362795]
Document Presentation Attack Detection (DPAD) is an important measure in protecting the authenticity of a document image.
Recent DPAD methods demand additional resources, such as manual effort in collecting additional data or knowing the parameters of acquisition devices.
This work proposes a DPAD method based on multi-modal disentangled traces (MMDT) without the above drawbacks.
arXiv Detail & Related papers (2024-04-10T00:11:03Z) - Active Generation for Image Classification [50.18107721267218]
We propose to address the efficiency of image generation by focusing on the specific needs and characteristics of the model.
With a central tenet of active learning, our method, named ActGen, takes a training-aware approach to image generation.
arXiv Detail & Related papers (2024-03-11T08:45:31Z) - Towards General Visual-Linguistic Face Forgery Detection [95.73987327101143]
Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust.
Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model.
We propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation.
arXiv Detail & Related papers (2023-07-31T10:22:33Z) - DocMAE: Document Image Rectification via Self-supervised Representation
Learning [144.44748607192147]
We present DocMAE, a novel self-supervised framework for document image rectification.
We first mask random patches of the background-excluded document images and then reconstruct the missing pixels.
With such a self-supervised learning approach, the network is encouraged to learn the intrinsic structure of deformed documents.
arXiv Detail & Related papers (2023-04-20T14:27:15Z) - Deep Unrestricted Document Image Rectification [110.61517455253308]
We present DocTr++, a novel unified framework for document image rectification.
We upgrade the original architecture by adopting a hierarchical encoder-decoder structure for multi-scale representation extraction and parsing.
We contribute a real-world test set and metrics applicable for evaluating the rectification quality.
arXiv Detail & Related papers (2023-04-18T08:00:54Z) - Two-branch Multi-scale Deep Neural Network for Generalized Document
Recapture Attack Detection [25.88454144842164]
The image recapture attack is an effective image manipulation method to erase certain forensic traces, and when targeting on personal document images, it poses a great threat to the security of e-commerce and other web applications.
We propose a novel two-branch deep neural network by mining better generalized recapture artifacts with a designed frequency filter bank and multi-scale cross-attention fusion module.
arXiv Detail & Related papers (2022-11-30T06:57:11Z) - Augraphy: A Data Augmentation Library for Document Images [59.457999432618614]
Augraphy is a Python library for constructing data augmentation pipelines.
It provides strategies to produce augmented versions of clean document images that appear to have been altered by standard office operations.
arXiv Detail & Related papers (2022-08-30T22:36:19Z) - DiT: Self-supervised Pre-training for Document Image Transformer [85.78807512344463]
We propose DiT, a self-supervised pre-trained Document Image Transformer model.
We leverage DiT as the backbone network in a variety of vision-based Document AI tasks.
Experiment results have illustrated that the self-supervised pre-trained DiT model achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-03-04T15:34:46Z) - RectiNet-v2: A stacked network architecture for document image dewarping [16.249023269158734]
We propose an end-to-end CNN architecture that can produce distortion free document images from warped documents it takes as input.
We train this model on warped document images simulated synthetically to compensate for lack of enough natural data.
We evaluate our method on the DocUNet dataset, a benchmark in this domain, and obtain results comparable to state-of-the-art methods.
arXiv Detail & Related papers (2021-02-01T19:26:17Z) - Multiple Document Datasets Pre-training Improves Text Line Detection
With Deep Neural Networks [2.5352713493505785]
We introduce a fully convolutional network for the document layout analysis task.
Our method Doc-UFCN relies on a U-shaped model trained from scratch for detecting objects from historical documents.
We show that Doc-UFCN outperforms state-of-the-art methods on various datasets.
arXiv Detail & Related papers (2020-12-28T09:48:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.