Learning JPEG Compression Artifacts for Image Manipulation Detection and
Localization
- URL: http://arxiv.org/abs/2108.12947v1
- Date: Mon, 30 Aug 2021 01:21:07 GMT
- Title: Learning JPEG Compression Artifacts for Image Manipulation Detection and
Localization
- Authors: Myung-Joon Kwon, Seung-Hun Nam, In-Jae Yu, Heung-Kyu Lee, Changick Kim
- Abstract summary: It is essential to distinguish between authentic and tampered regions by analyzing intrinsic statistics in an image.
We propose a convolutional neural network (CNN) that uses discrete cosine transform (DCT) coefficients, where compression artifacts remain, to localize image manipulation.
We show how to design and train a neural network that can learn the distribution of DCT coefficients.
It significantly outperforms traditional and deep neural network-based methods in detecting and localizing tampered regions.
- Score: 26.36646590957593
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Detecting and localizing image manipulation are necessary to counter
malicious use of image editing techniques. Accordingly, it is essential to
distinguish between authentic and tampered regions by analyzing intrinsic
statistics in an image. We focus on JPEG compression artifacts left during
image acquisition and editing. We propose a convolutional neural network (CNN)
that uses discrete cosine transform (DCT) coefficients, where compression
artifacts remain, to localize image manipulation. Standard CNNs cannot learn
the distribution of DCT coefficients because the convolution throws away the
spatial coordinates, which are essential for DCT coefficients. We illustrate
how to design and train a neural network that can learn the distribution of DCT
coefficients. Furthermore, we introduce Compression Artifact Tracing Network
(CAT-Net) that jointly uses image acquisition artifacts and compression
artifacts. It significantly outperforms traditional and deep neural
network-based methods in detecting and localizing tampered regions.
Related papers
- Deep Learning Based Speckle Filtering for Polarimetric SAR Images. Application to Sentinel-1 [51.404644401997736]
We propose a complete framework to remove speckle in polarimetric SAR images using a convolutional neural network.
Experiments show that the proposed approach offers exceptional results in both speckle reduction and resolution preservation.
arXiv Detail & Related papers (2024-08-28T10:07:17Z) - Recompression Based JPEG Tamper Detection and Localization Using Deep Neural Network Eliminating Compression Factor Dependency [2.8498944632323755]
We propose a Convolution Neural Network based deep learning architecture, which is capable of detecting the presence of re compression based forgery in JPEG images.
In this work, we also aim to localize the regions of image manipulation based on re compression features, using the trained neural network.
arXiv Detail & Related papers (2024-07-03T09:19:35Z) - Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression [58.618625678054826]
This study presents an enhanced neural compression method designed for optimal visual fidelity.
We have trained our model with a sophisticated semantic ensemble loss, integrating Charbonnier loss, perceptual loss, style loss, and a non-binary adversarial loss.
Our empirical findings demonstrate that this approach significantly improves the statistical fidelity of neural image compression.
arXiv Detail & Related papers (2024-01-25T08:11:27Z) - A Deep Learning-based Compression and Classification Technique for Whole
Slide Histopathology Images [0.31498833540989407]
We build an ensemble of neural networks that enables a compressive autoencoder in a supervised fashion to retain a denser and more meaningful representation of the input histology images.
We test the compressed images using transfer learning-based classifiers and show that they provide promising accuracy and classification performance.
arXiv Detail & Related papers (2023-05-11T22:20:05Z) - Learned Lossless Compression for JPEG via Frequency-Domain Prediction [50.20577108662153]
We propose a novel framework for learned lossless compression of JPEG images.
To enable learning in the frequency domain, DCT coefficients are partitioned into groups to utilize implicit local redundancy.
An autoencoder-like architecture is designed based on the weight-shared blocks to realize entropy modeling of grouped DCT coefficients.
arXiv Detail & Related papers (2023-03-05T13:15:28Z) - Decoupled Mixup for Generalized Visual Recognition [71.13734761715472]
We propose a novel "Decoupled-Mixup" method to train CNN models for visual recognition.
Our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions to train CNN models.
Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts.
arXiv Detail & Related papers (2022-10-26T15:21:39Z) - Convolutional Neural Network (CNN) to reduce construction loss in JPEG
compression caused by Discrete Fourier Transform (DFT) [0.0]
Convolutional Neural Networks (CNN) have received more attention than most other types of deep neural networks.
In this work, an effective image compression method is purposed using autoencoders.
arXiv Detail & Related papers (2022-08-26T12:46:16Z) - The Devil Is in the Details: Window-based Attention for Image
Compression [58.1577742463617]
Most existing learned image compression models are based on Convolutional Neural Networks (CNNs)
In this paper, we study the effects of multiple kinds of attention mechanisms for local features learning, then introduce a more straightforward yet effective window-based local attention block.
The proposed window-based attention is very flexible which could work as a plug-and-play component to enhance CNN and Transformer models.
arXiv Detail & Related papers (2022-03-16T07:55:49Z) - Joint Global and Local Hierarchical Priors for Learned Image Compression [30.44884350320053]
Recently, learned image compression methods have shown superior performance compared to the traditional hand-crafted image codecs.
We propose a novel entropy model called Information Transformer (Informer) that exploits both local and global information in a content-dependent manner.
Our experiments demonstrate that Informer improves rate-distortion performance over the state-of-the-art methods on the Kodak and Tecnick datasets.
arXiv Detail & Related papers (2021-12-08T06:17:37Z) - CNN Filter Learning from Drawn Markers for the Detection of Suggestive
Signs of COVID-19 in CT Images [58.720142291102135]
We propose a method that does not require either large annotated datasets or backpropagation to estimate the filters of a convolutional neural network (CNN)
For a few CT images, the user draws markers at representative normal and abnormal regions.
The method generates a feature extractor composed of a sequence of convolutional layers, whose kernels are specialized in enhancing regions similar to the marked ones.
arXiv Detail & Related papers (2021-11-16T15:03:42Z) - Less is More: Accelerating Faster Neural Networks Straight from JPEG [1.9214041945441434]
We show how to speed up convolutional neural networks for processing JPEG compressed data.
We exploit learning strategies to reduce the computational complexity by taking full advantage of DCT inputs.
Results show that learning how to combine all DCT inputs in a data-driven fashion is better than discarding them by hand.
arXiv Detail & Related papers (2021-04-01T01:21:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.