Two-stream Encoder-Decoder Network for Localizing Image Forgeries
- URL: http://arxiv.org/abs/2009.12881v1
- Date: Sun, 27 Sep 2020 15:49:17 GMT
- Title: Two-stream Encoder-Decoder Network for Localizing Image Forgeries
- Authors: Aniruddha Mazumdar and Prabin Kumar Bora
- Abstract summary: We propose a novel two-stream encoder-decoder network, which utilizes both the high-level and the low-level image features.
We have carried out experimental analysis on multiple standard forensics datasets to evaluate the performance of the proposed method.
- Score: 4.982505311411925
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes a novel two-stream encoder-decoder network, which
utilizes both the high-level and the low-level image features for precisely
localizing forged regions in a manipulated image. This is motivated from the
fact that the forgery creation process generally introduces both the high-level
artefacts (e.g. unnatural contrast) and the low-level artefacts (e.g. noise
inconsistency) to the forged images. In the proposed two-stream network, one
stream learns the low-level manipulation-related features in the encoder side
by extracting noise residuals through a set of high-pass filters in the first
layer of the encoder network. In the second stream, the encoder learns the
high-level image manipulation features from the input image RGB values. The
coarse feature maps of both the encoders are upsampled by their corresponding
decoder network to produce dense feature maps. The dense feature maps of the
two streams are concatenated and fed to a final convolutional layer with
sigmoidal activation to produce pixel-wise prediction. We have carried out
experimental analysis on multiple standard forensics datasets to evaluate the
performance of the proposed method. The experimental results show the efficacy
of the proposed method with respect to the state-of-the-art.
Related papers
- HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression [51.04820313355164]
HyrbidFlow combines the continuous-feature-based and codebook-based streams to achieve both high perceptual quality and high fidelity under extreme lows.
Experimental results demonstrate superior performance across several datasets under extremely lows.
arXiv Detail & Related papers (2024-04-20T13:19:08Z) - Neuromorphic Synergy for Video Binarization [54.195375576583864]
Bimodal objects serve as a visual form to embed information that can be easily recognized by vision systems.
Neuromorphic cameras offer new capabilities for alleviating motion blur, but it is non-trivial to first de-blur and then binarize the images in a real-time manner.
We propose an event-based binary reconstruction method that leverages the prior knowledge of the bimodal target's properties to perform inference independently in both event space and image space.
We also develop an efficient integration method to propagate this binary image to high frame rate binary video.
arXiv Detail & Related papers (2024-02-20T01:43:51Z) - DocBinFormer: A Two-Level Transformer Network for Effective Document
Image Binarization [17.087982099845156]
Document binarization is a fundamental and crucial step for achieving the most optimal performance in any document analysis task.
We propose DocBinFormer, a novel two-level vision transformer (TL-ViT) architecture based on vision transformers for effective document image binarization.
arXiv Detail & Related papers (2023-12-06T16:01:29Z) - Triple-View Knowledge Distillation for Semi-Supervised Semantic
Segmentation [54.23510028456082]
We propose a Triple-view Knowledge Distillation framework, termed TriKD, for semi-supervised semantic segmentation.
The framework includes the triple-view encoder and the dual-frequency decoder.
arXiv Detail & Related papers (2023-09-22T01:02:21Z) - A Multi-Stream Fusion Network for Image Splicing Localization [18.505512386111985]
We propose an encoder-decoder architecture that consists of multiple encoder streams.
Each stream is fed with either the tampered image or handcrafted signals and processes them separately to capture relevant information from each one independently.
The extracted features from the multiple streams are fused in the bottleneck of the architecture and propagated to the decoder network that generates the output localization map.
arXiv Detail & Related papers (2022-12-02T12:17:53Z) - Towards Complex Backgrounds: A Unified Difference-Aware Decoder for
Binary Segmentation [4.6932442139663015]
A new unified dual-branch decoder paradigm named the difference-aware decoder is proposed in this paper.
The difference-aware decoder imitates the human eye in three stages using the multi-level features output by the encoder.
The results demonstrate that the difference-aware decoder can achieve a higher accuracy than the other state-of-the-art binary segmentation methods.
arXiv Detail & Related papers (2022-10-27T03:45:29Z) - Small Lesion Segmentation in Brain MRIs with Subpixel Embedding [105.1223735549524]
We present a method to segment MRI scans of the human brain into ischemic stroke lesion and normal tissues.
We propose a neural network architecture in the form of a standard encoder-decoder where predictions are guided by a spatial expansion embedding network.
arXiv Detail & Related papers (2021-09-18T00:21:17Z) - Sequential vessel segmentation via deep channel attention network [5.941874421818899]
This paper develops a novel encoder-decoder deep network architecture.
It exploits the several contextual frames of 2D+t sequential images in a sliding window centered at current frame to segment 2D vessel masks from the current frame.
The architecture is equipped with temporal-spatial feature extraction in encoder stage, feature fusion in skip connection layers and channel attention mechanism in decoder stage.
arXiv Detail & Related papers (2021-02-10T02:45:08Z) - Convolutional Autoencoder for Blind Hyperspectral Image Unmixing [0.0]
spectral unmixing is a technique to decompose a mixed pixel into two fundamental representatives: endmembers and abundances.
In this paper, a novel architecture is proposed to perform blind unmixing on hyperspectral images.
arXiv Detail & Related papers (2020-11-18T17:41:31Z) - Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic
Image Segmentation [56.44853893149365]
Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers.
We propose a new architecture based on a decoder which uses a set of shallow networks for capturing more information content.
In order to further improve the architecture we introduce a weight function which aims to re-balance classes to increase the attention of the networks to under-represented objects.
arXiv Detail & Related papers (2020-07-19T18:44:34Z) - Rethinking and Improving Natural Language Generation with Layer-Wise
Multi-View Decoding [59.48857453699463]
In sequence-to-sequence learning, the decoder relies on the attention mechanism to efficiently extract information from the encoder.
Recent work has proposed to use representations from different encoder layers for diversified levels of information.
We propose layer-wise multi-view decoding, where for each decoder layer, together with the representations from the last encoder layer, which serve as a global view, those from other encoder layers are supplemented for a stereoscopic view of the source sequences.
arXiv Detail & Related papers (2020-05-16T20:00:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.