Dual Stream Computer-Generated Image Detection Network Based On Channel
Joint And Softpool
- URL: http://arxiv.org/abs/2207.03205v1
- Date: Thu, 7 Jul 2022 10:19:04 GMT
- Title: Dual Stream Computer-Generated Image Detection Network Based On Channel
Joint And Softpool
- Authors: Ziyi Xi, Hao Lin, Weiqi Luo
- Abstract summary: How to distinguish the computer-generated graphics (CG) from the photographs (PG) has become an important topic in the field of digital image forensics.
This paper proposes a dual stream convolutional neural network based on channel joint and softpool.
Experiments on SPL2018 and DsTok show that the proposed method outperforms existing methods.
- Score: 10.743766528498943
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the development of computer graphics technology, the images synthesized
by computer software become more and more closer to the photographs. While
computer graphics technology brings us a grand visual feast in the field of
games and movies, it may also be utilized by someone with bad intentions to
guide public opinions and cause political crisis or social unrest. Therefore,
how to distinguish the computer-generated graphics (CG) from the photographs
(PG) has become an important topic in the field of digital image forensics.
This paper proposes a dual stream convolutional neural network based on channel
joint and softpool. The proposed network architecture includes a residual
module for extracting image noise information and a joint channel information
extraction module for capturing the shallow semantic information of image. In
addition, we also design a residual structure to enhance feature extraction and
reduce the loss of information in residual flow. The joint channel information
extraction module can obtain the shallow semantic information of the input
image which can be used as the information supplement block of the residual
module. The whole network uses SoftPool to reduce the information loss of
down-sampling for image. Finally, we fuse the two flows to get the
classification results. Experiments on SPL2018 and DsTok show that the proposed
method outperforms existing methods, especially on the DsTok dataset. For
example, the performance of our model surpasses the state-of-the-art by a large
margin of 3%.
Related papers
- A Modular System for Enhanced Robustness of Multimedia Understanding
Networks via Deep Parametric Estimation [30.904034138920057]
In multimedia understanding tasks, corrupted samples pose a critical challenge, because when fed to machine learning models they lead to performance degradation.
We propose SyMPIE to enhance input data for robust downstream multimedia understanding with minimal computational cost.
Our key insight is that most input corruptions can be modeled through global operations on color channels of images or spatial filters with small kernels.
arXiv Detail & Related papers (2024-02-28T15:24:58Z) - A cross Transformer for image denoising [83.68175077524111]
We propose a cross Transformer denoising CNN (CTNet) with a serial block (SB), a parallel block (PB), and a residual block (RB)
CTNet is superior to some popular denoising methods in terms of real and synthetic image denoising.
arXiv Detail & Related papers (2023-10-16T13:53:19Z) - Exploring Effective Mask Sampling Modeling for Neural Image Compression [171.35596121939238]
Most existing neural image compression methods rely on side information from hyperprior or context models to eliminate spatial redundancy.
Inspired by the mask sampling modeling in recent self-supervised learning methods for natural language processing and high-level vision, we propose a novel pretraining strategy for neural image compression.
Our method achieves competitive performance with lower computational complexity compared to state-of-the-art image compression methods.
arXiv Detail & Related papers (2023-06-09T06:50:20Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - Reducing Redundancy in the Bottleneck Representation of the Autoencoders [98.78384185493624]
Autoencoders are a type of unsupervised neural networks, which can be used to solve various tasks.
We propose a scheme to explicitly penalize feature redundancies in the bottleneck representation.
We tested our approach across different tasks: dimensionality reduction using three different dataset, image compression using the MNIST dataset, and image denoising using fashion MNIST.
arXiv Detail & Related papers (2022-02-09T18:48:02Z) - A Compact Neural Network-based Algorithm for Robust Image Watermarking [30.727227627295548]
We propose a novel digital image watermarking solution with a compact neural network, named Invertible Watermarking Network (IWN)
Our IWN architecture is based on a single Invertible Neural Network (INN)
In order to enhance the robustness of our watermarking solution, we specifically introduce a simple but effective bit message normalization module.
arXiv Detail & Related papers (2021-12-27T03:20:45Z) - Image deblurring based on lightweight multi-information fusion network [6.848061582669787]
We propose a lightweight multiinformation fusion network (LMFN) for image deblurring.
In the encoding stage, the image feature is reduced to various smallscale spaces for multi-scale information extraction and fusion.
Then, a distillation network is used in the decoding stage, which allows the network benefit the most from residual learning.
Our network can achieve state-of-the-art image deblurring result with smaller number of parameters and outperforms existing methods in model complexity.
arXiv Detail & Related papers (2021-01-14T00:37:37Z) - Designing and Training of A Dual CNN for Image Denoising [117.54244339673316]
We propose a Dual denoising Network (DudeNet) to recover a clean image.
DudeNet consists of four modules: a feature extraction block, an enhancement block, a compression block, and a reconstruction block.
arXiv Detail & Related papers (2020-07-08T08:16:24Z) - Modeling Lost Information in Lossy Image Compression [72.69327382643549]
Lossy image compression is one of the most commonly used operators for digital images.
We propose a novel invertible framework called Invertible Lossy Compression (ILC) to largely mitigate the information loss problem.
arXiv Detail & Related papers (2020-06-22T04:04:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.