Holistic Image Manipulation Detection using Pixel Co-occurrence Matrices
- URL: http://arxiv.org/abs/2104.05693v1
- Date: Mon, 12 Apr 2021 17:54:42 GMT
- Title: Holistic Image Manipulation Detection using Pixel Co-occurrence Matrices
- Authors: Lakshmanan Nataraj, Michael Goebel, Tajuddin Manhar Mohammed,
Shivkumar Chandrasekaran, B. S. Manjunath
- Abstract summary: Digital image forensics aims to detect images that have been digitally manipulated.
Most detection methods in literature focus on detecting a particular type of manipulation.
We propose a novel approach to holistically detect tampered images using a combination of pixel co-occurrence matrices and deep learning.
- Score: 16.224649756613655
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Digital image forensics aims to detect images that have been digitally
manipulated. Realistic image forgeries involve a combination of splicing,
resampling, region removal, smoothing and other manipulation methods. While
most detection methods in literature focus on detecting a particular type of
manipulation, it is challenging to identify doctored images that involve a host
of manipulations. In this paper, we propose a novel approach to holistically
detect tampered images using a combination of pixel co-occurrence matrices and
deep learning. We extract horizontal and vertical co-occurrence matrices on
three color channels in the pixel domain and train a model using a deep
convolutional neural network (CNN) framework. Our method is agnostic to the
type of manipulation and classifies an image as tampered or untampered. We
train and validate our model on a dataset of more than 86,000 images.
Experimental results show that our approach is promising and achieves more than
0.99 area under the curve (AUC) evaluation metric on the training and
validation subsets. Further, our approach also generalizes well and achieves
around 0.81 AUC on an unseen test dataset comprising more than 19,740 images
released as part of the Media Forensics Challenge (MFC) 2020. Our score was
highest among all other teams that participated in the challenge, at the time
of announcement of the challenge results.
Related papers
- Semi-Supervised Segmentation via Embedding Matching [0.8896991256227597]
Deep convolutional neural networks are widely used in medical image segmentation but require many labeled images for training.
We propose a novel semi-supervised segmentation method that leverages mostly unlabeled images and a small set of labeled images in training.
The proposed approach yielded a Hausdorff distance with 95th percentile (HD95) of 3.30 and IoU of 0.929, surpassing existing methods achieving HD95 (4.07) and IoU (0.927) at their best.
arXiv Detail & Related papers (2024-07-05T16:49:21Z) - Detecting Generated Images by Real Images Only [64.12501227493765]
Existing generated image detection methods detect visual artifacts in generated images or learn discriminative features from both real and generated images by massive training.
This paper approaches the generated image detection problem from a new perspective: Start from real images.
By finding the commonality of real images and mapping them to a dense subspace in feature space, the goal is that generated images, regardless of their generative model, are then projected outside the subspace.
arXiv Detail & Related papers (2023-11-02T03:09:37Z) - Pixel-Inconsistency Modeling for Image Manipulation Localization [63.54342601757723]
Digital image forensics plays a crucial role in image authentication and manipulation localization.
This paper presents a generalized and robust manipulation localization model through the analysis of pixel inconsistency artifacts.
Experiments show that our method successfully extracts inherent pixel-inconsistency forgery fingerprints.
arXiv Detail & Related papers (2023-09-30T02:54:51Z) - Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image
Classification Using Transformers [0.11219061154635457]
Whole-Slide Imaging allows for the capturing and digitization of high-resolution images of histological specimen.
transformer architecture has been proposed as a possible candidate for effectively leveraging the high-resolution information.
We propose a novel cascaded cross-attention network (CCAN) based on the cross-attention mechanism that scales linearly with the number of extracted patches.
arXiv Detail & Related papers (2023-05-11T16:42:24Z) - Enhanced Sharp-GAN For Histopathology Image Synthesis [63.845552349914186]
Histopathology image synthesis aims to address the data shortage issue in training deep learning approaches for accurate cancer detection.
We propose a novel approach that enhances the quality of synthetic images by using nuclei topology and contour regularization.
The proposed approach outperforms Sharp-GAN in all four image quality metrics on two datasets.
arXiv Detail & Related papers (2023-01-24T17:54:01Z) - ObjectFormer for Image Manipulation Detection and Localization [118.89882740099137]
We propose ObjectFormer to detect and localize image manipulations.
We extract high-frequency features of the images and combine them with RGB features as multimodal patch embeddings.
We conduct extensive experiments on various datasets and the results verify the effectiveness of the proposed method.
arXiv Detail & Related papers (2022-03-28T12:27:34Z) - SISL:Self-Supervised Image Signature Learning for Splicing Detection and
Localization [11.437760125881049]
We propose self-supervised approach for training splicing detection/localization models from frequency transforms of images.
Our proposed model can yield similar or better performances on standard datasets without relying on labels or metadata.
arXiv Detail & Related papers (2022-03-15T12:26:29Z) - CNN Filter Learning from Drawn Markers for the Detection of Suggestive
Signs of COVID-19 in CT Images [58.720142291102135]
We propose a method that does not require either large annotated datasets or backpropagation to estimate the filters of a convolutional neural network (CNN)
For a few CT images, the user draws markers at representative normal and abnormal regions.
The method generates a feature extractor composed of a sequence of convolutional layers, whose kernels are specialized in enhancing regions similar to the marked ones.
arXiv Detail & Related papers (2021-11-16T15:03:42Z) - Noise Doesn't Lie: Towards Universal Detection of Deep Inpainting [42.189768203036394]
We make the first attempt towards universal detection of deep inpainting, where the detection network can generalize well.
Our approach outperforms existing detection methods by a large margin and generalizes well to unseen deep inpainting techniques.
arXiv Detail & Related papers (2021-06-03T01:29:29Z) - M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection [74.19291916812921]
forged images generated by Deepfake techniques pose a serious threat to the trustworthiness of digital information.
In this paper, we aim to capture the subtle manipulation artifacts at different scales for Deepfake detection.
We introduce a high-quality Deepfake dataset, SR-DF, which consists of 4,000 DeepFake videos generated by state-of-the-art face swapping and facial reenactment methods.
arXiv Detail & Related papers (2021-04-20T05:43:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.