Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
- URL: http://arxiv.org/abs/2004.13388v1
- Date: Tue, 28 Apr 2020 09:34:47 GMT
- Title: Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
- Authors: Hang Dong, Jinshan Pan, Lei Xiang, Zhe Hu, Xinyi Zhang, Fei Wang,
Ming-Hsuan Yang
- Abstract summary: We propose a Multi-Scale Boosted Dehazing Network with Dense Feature Fusion based on the U-Net architecture.
We show that the proposed model performs favorably against the state-of-the-art approaches on the benchmark datasets as well as real-world hazy images.
- Score: 92.92572594942071
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a Multi-Scale Boosted Dehazing Network with Dense
Feature Fusion based on the U-Net architecture. The proposed method is designed
based on two principles, boosting and error feedback, and we show that they are
suitable for the dehazing problem. By incorporating the
Strengthen-Operate-Subtract boosting strategy in the decoder of the proposed
model, we develop a simple yet effective boosted decoder to progressively
restore the haze-free image. To address the issue of preserving spatial
information in the U-Net architecture, we design a dense feature fusion module
using the back-projection feedback scheme. We show that the dense feature
fusion module can simultaneously remedy the missing spatial information from
high-resolution features and exploit the non-adjacent features. Extensive
evaluations demonstrate that the proposed model performs favorably against the
state-of-the-art approaches on the benchmark datasets as well as real-world
hazy images.
Related papers
- Addressing Domain Discrepancy: A Dual-branch Collaborative Model to Unsupervised Dehazing [1.6624384368855527]
This paper proposes a novel dual-branch collaborative unpaired dehazing model (DCM-dehaze) to address this issue.
Specifically, we design a dual depthwise separable convolutional module (DDSCM) to enhance the information of deeper features.
In addition, we construct a bidirectional contour function to optimize the edge features of the image to enhance the clarity and fidelity of the image details.
arXiv Detail & Related papers (2024-07-14T14:47:32Z) - Multi-view Aggregation Network for Dichotomous Image Segmentation [76.75904424539543]
Dichotomous Image (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images.
Existing methods rely on tedious multiple encoder-decoder streams and stages to gradually complete the global localization and local refinement.
Inspired by it, we model DIS as a multi-view object perception problem and provide a parsimonious multi-view aggregation network (MVANet)
Experiments on the popular DIS-5K dataset show that our MVANet significantly outperforms state-of-the-art methods in both accuracy and speed.
arXiv Detail & Related papers (2024-04-11T03:00:00Z) - FeatUp: A Model-Agnostic Framework for Features at Any Resolution [24.4201195336725]
FeatUp is a task- and model-agnostic framework to restore lost spatial information in deep features.
We introduce two variants of FeatUp: one that guides features with high-resolution signal in a single forward pass, and one that fits an implicit model to a single image to reconstruct features at any resolution.
We show that FeatUp significantly outperforms other feature upsampling and image super-resolution approaches in class activation map generation, transfer learning for segmentation and depth prediction, and end-to-end training for semantic segmentation.
arXiv Detail & Related papers (2024-03-15T17:57:06Z) - Diffusion Models Without Attention [110.5623058129782]
Diffusion State Space Model (DiffuSSM) is an architecture that supplants attention mechanisms with a more scalable state space model backbone.
Our focus on FLOP-efficient architectures in diffusion training marks a significant step forward.
arXiv Detail & Related papers (2023-11-30T05:15:35Z) - Searching a Compact Architecture for Robust Multi-Exposure Image Fusion [55.37210629454589]
Two major stumbling blocks hinder the development, including pixel misalignment and inefficient inference.
This study introduces an architecture search-based paradigm incorporating self-alignment and detail repletion modules for robust multi-exposure image fusion.
The proposed method outperforms various competitive schemes, achieving a noteworthy 3.19% improvement in PSNR for general scenarios and an impressive 23.5% enhancement in misaligned scenarios.
arXiv Detail & Related papers (2023-05-20T17:01:52Z) - DepthFormer: Exploiting Long-Range Correlation and Local Information for
Accurate Monocular Depth Estimation [50.08080424613603]
Long-range correlation is essential for accurate monocular depth estimation.
We propose to leverage the Transformer to model this global context with an effective attention mechanism.
Our proposed model, termed DepthFormer, surpasses state-of-the-art monocular depth estimation methods with prominent margins.
arXiv Detail & Related papers (2022-03-27T05:03:56Z) - Global-Local Path Networks for Monocular Depth Estimation with Vertical
CutDepth [24.897377434844266]
We propose a novel structure and training strategy for monocular depth estimation.
We deploy a hierarchical transformer encoder to capture and convey the global context, and design a lightweight yet powerful decoder.
Our network achieves state-of-the-art performance over the challenging depth dataset NYU Depth V2.
arXiv Detail & Related papers (2022-01-19T06:37:21Z) - GridDehazeNet+: An Enhanced Multi-Scale Network with Intra-Task
Knowledge Transfer for Single Image Dehazing [12.982905875008214]
We propose an enhanced multi-scale network, dubbed GridDehazeNet+, for single image dehazing.
It consists of three modules: pre-processing, backbone, and post-processing.
arXiv Detail & Related papers (2021-03-25T17:35:36Z) - MetaSDF: Meta-learning Signed Distance Functions [85.81290552559817]
Generalizing across shapes with neural implicit representations amounts to learning priors over the respective function space.
We formalize learning of a shape space as a meta-learning problem and leverage gradient-based meta-learning algorithms to solve this task.
arXiv Detail & Related papers (2020-06-17T05:14:53Z) - Toward a Controllable Disentanglement Network [22.968760397814993]
This paper addresses two crucial problems of learning disentangled image representations, namely controlling the degree of disentanglement during image editing, and balancing the disentanglement strength and the reconstruction quality.
By exploring the real-valued space of the soft target representation, we are able to synthesize novel images with the designated properties.
arXiv Detail & Related papers (2020-01-22T16:54:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.