DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing
- URL: http://arxiv.org/abs/2312.09955v1
- Date: Fri, 15 Dec 2023 17:05:32 GMT
- Title: DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing
- Authors: Abdul Wasi, O. Jeba Shiney
- Abstract summary: Images acquired in hazy conditions have degradations induced in them.
Prior-based and learning-based approaches have been proposed to mitigate the effect of haze and generate haze-free images.
In this paper, a method that uses residual learning and vision transformers in an attention module is proposed.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Images acquired in hazy conditions have degradations induced in them.
Dehazing such images is a vexed and ill-posed problem. Scores of prior-based
and learning-based approaches have been proposed to mitigate the effect of haze
and generate haze-free images. Many conventional methods are constrained by
their lack of awareness regarding scene depth and their incapacity to capture
long-range dependencies. In this paper, a method that uses residual learning
and vision transformers in an attention module is proposed. It essentially
comprises two networks: In the first one, the network takes the ratio of a hazy
image and the approximated transmission matrix to estimate a residual map. The
second network takes this residual image as input and passes it through
convolution layers before superposing it on the generated feature maps. It is
then passed through global context and depth-aware transformer encoders to
obtain channel attention. The attention module then infers the spatial
attention map before generating the final haze-free image. Experimental
results, including several quantitative metrics, demonstrate the efficiency and
scalability of the suggested methodology.
Related papers
- DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN [60.257791714663725]
We propose a Prior map Guided CycleGAN (PG-CycleGAN) for defogging of images with overwater scenes.
The proposed method outperforms the state-of-the-art supervised, semi-supervised, and unsupervised defogging approaches.
arXiv Detail & Related papers (2022-12-23T03:00:28Z) - dual unet:a novel siamese network for change detection with cascade
differential fusion [4.651756476458979]
We propose a novel Siamese neural network for change detection task, namely Dual-UNet.
In contrast to previous individually encoded the bitemporal images, we design an encoder differential-attention module to focus on the spatial difference relationships of pixels.
Experiments demonstrate that the proposed approach consistently outperforms the most advanced methods on popular seasonal change detection datasets.
arXiv Detail & Related papers (2022-08-12T14:24:09Z) - Adaptive Image Inpainting [43.02281823557039]
Inpainting methods have shown significant improvements by using deep neural networks.
The problem is rooted in the encoder layers' ineffectiveness in building a complete and faithful embedding of the missing regions.
We propose a distillation based approach for inpainting, where we provide direct feature level supervision for the encoder layers.
arXiv Detail & Related papers (2022-01-01T12:16:01Z) - Learning Discriminative Shrinkage Deep Networks for Image Deconvolution [122.79108159874426]
We propose an effective non-blind deconvolution approach by learning discriminative shrinkage functions to implicitly model these terms.
Experimental results show that the proposed method performs favorably against the state-of-the-art ones in terms of efficiency and accuracy.
arXiv Detail & Related papers (2021-11-27T12:12:57Z) - Is Deep Image Prior in Need of a Good Education? [57.3399060347311]
Deep image prior was introduced as an effective prior for image reconstruction.
Despite its impressive reconstructive properties, the approach is slow when compared to learned or traditional reconstruction techniques.
We develop a two-stage learning paradigm to address the computational challenge.
arXiv Detail & Related papers (2021-11-23T15:08:26Z) - Single image dehazing via combining the prior knowledge and CNNs [6.566615606042994]
An end-to-end system is proposed in this paper to reduce defects by combining the prior knowledge and deep learning method.
Experiments show that the proposed method achieves superior performance over existing methods.
arXiv Detail & Related papers (2021-11-10T14:18:25Z) - Non-Homogeneous Haze Removal via Artificial Scene Prior and
Bidimensional Graph Reasoning [52.07698484363237]
We propose a Non-Homogeneous Haze Removal Network (NHRN) via artificial scene prior and bidimensional graph reasoning.
Our method achieves superior performance over many state-of-the-art algorithms for both the single image dehazing and hazy image understanding tasks.
arXiv Detail & Related papers (2021-04-05T13:04:44Z) - Gated Fusion Network for Degraded Image Super Resolution [78.67168802945069]
We propose a dual-branch convolutional neural network to extract base features and recovered features separately.
By decomposing the feature extraction step into two task-independent streams, the dual-branch model can facilitate the training process.
arXiv Detail & Related papers (2020-03-02T13:28:32Z) - Class-Specific Blind Deconvolutional Phase Retrieval Under a Generative
Prior [8.712404218757733]
The problem arises in various imaging modalities such as Fourier ptychography, X-ray crystallography, and in visible light communication.
We propose to solve this inverse problem using alternating gradient descent algorithm under two pretrained deep generative networks as priors.
The proposed recovery algorithm strives to find a sharp image and a blur kernel in the range of the respective pre-generators that textitbest explain the forward measurement model.
arXiv Detail & Related papers (2020-02-28T07:36:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.