Lightweight network towards real-time image denoising on mobile devices
- URL: http://arxiv.org/abs/2211.04687v2
- Date: Thu, 25 May 2023 15:23:27 GMT
- Title: Lightweight network towards real-time image denoising on mobile devices
- Authors: Zhuoqun Liu and Meiguang Jin and Ying Chen and Huaida Liu and Canqian
Yang and Hongkai Xiong
- Abstract summary: Deep convolutional neural networks have achieved great progress in image denoising tasks.
Their complicated architectures and heavy computational cost hinder their deployments on mobile devices.
We propose a mobile-friendly denoising network, namely MFDNet.
- Score: 26.130379174715742
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep convolutional neural networks have achieved great progress in image
denoising tasks. However, their complicated architectures and heavy
computational cost hinder their deployments on mobile devices. Some recent
efforts in designing lightweight denoising networks focus on reducing either
FLOPs (floating-point operations) or the number of parameters. However, these
metrics are not directly correlated with the on-device latency. In this paper,
we identify the real bottlenecks that affect the CNN-based models' run-time
performance on mobile devices: memory access cost and NPU-incompatible
operations, and build the model based on these. To further improve the
denoising performance, the mobile-friendly attention module MFA and the model
reparameterization module RepConv are proposed, which enjoy both low latency
and excellent denoising performance. To this end, we propose a mobile-friendly
denoising network, namely MFDNet. The experiments show that MFDNet achieves
state-of-the-art performance on real-world denoising benchmarks SIDD and DND
under real-time latency on mobile devices. The code and pre-trained models will
be released.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - A Real-Time Voice Activity Detection Based On Lightweight Neural [4.589472292598182]
Voice activity detection (VAD) is the task of detecting speech in an audio stream.
Recent neural network-based VADs have alleviated the degradation of performance to some extent.
We propose a lightweight and real-time neural network called MagicNet, which utilizes casual and depth separable 1-D convolutions and GRU.
arXiv Detail & Related papers (2024-05-27T03:31:16Z) - Latency-aware Unified Dynamic Networks for Efficient Image Recognition [72.8951331472913]
LAUDNet is a framework to bridge the theoretical and practical efficiency gap in dynamic networks.
It integrates three primary dynamic paradigms-spatially adaptive computation, dynamic layer skipping, and dynamic channel skipping.
It can notably reduce the latency of models like ResNet by over 50% on platforms such as V100,3090, and TX2 GPUs.
arXiv Detail & Related papers (2023-08-30T10:57:41Z) - Thunder: Thumbnail based Fast Lightweight Image Denoising Network [92.9631117239565]
A textbfThumbtextbfnail based textbfDtextbfenoising Netwotextbfrk dubbed Thunder is proposed.
arXiv Detail & Related papers (2022-05-24T06:38:46Z) - IMDeception: Grouped Information Distilling Super-Resolution Network [7.6146285961466]
Single-Image-Super-Resolution (SISR) is a classical computer vision problem that has benefited from the recent advancements in deep learning methods.
In this work, we propose the Global Progressive Refinement Module (GPRM) as a less parameter-demanding alternative to the IIC module for feature aggregation.
We also propose Grouped Information Distilling Blocks (GIDB) to further decrease the number of parameters and floating point operations persecond (FLOPS)
Experiments reveal that the proposed network performs on par with state-of-the-art models despite having a limited number of parameters and FLOPS
arXiv Detail & Related papers (2022-04-25T06:43:45Z) - A Study of Designing Compact Audio-Visual Wake Word Spotting System
Based on Iterative Fine-Tuning in Neural Network Pruning [57.28467469709369]
We investigate on designing a compact audio-visual wake word spotting (WWS) system by utilizing visual information.
We introduce a neural network pruning strategy via the lottery ticket hypothesis in an iterative fine-tuning manner (LTH-IF)
The proposed audio-visual system achieves significant performance improvements over the single-modality (audio-only or video-only) system under different noisy conditions.
arXiv Detail & Related papers (2022-02-17T08:26:25Z) - An Adaptive Device-Edge Co-Inference Framework Based on Soft
Actor-Critic [72.35307086274912]
High-dimension parameter model and large-scale mathematical calculation restrict execution efficiency, especially for Internet of Things (IoT) devices.
We propose a new Deep Reinforcement Learning (DRL)-Soft Actor Critic for discrete (SAC-d), which generates the emphexit point, emphexit point, and emphcompressing bits by soft policy iterations.
Based on the latency and accuracy aware reward design, such an computation can well adapt to the complex environment like dynamic wireless channel and arbitrary processing, and is capable of supporting the 5G URL
arXiv Detail & Related papers (2022-01-09T09:31:50Z) - Architecture Aware Latency Constrained Sparse Neural Networks [35.50683537052815]
In this paper, we design an architecture aware latency constrained sparse framework to prune and accelerate CNN models.
We also propose a novel sparse convolution algorithm for efficient computation.
Our system-algorithm co-design framework can achieve much better frontier among network accuracy and latency on resource-constrained mobile devices.
arXiv Detail & Related papers (2021-09-01T03:41:31Z) - Overparametrization of HyperNetworks at Fixed FLOP-Count Enables Fast
Neural Image Enhancement [0.0]
Deep convolutional neural networks can enhance images taken with small mobile camera sensors and excel at tasks like demoisaicing, denoising and super-resolution.
For practical use on mobile devices these networks often require too many FLOPs and reducing the FLOPs of a convolution layer, also reduces its parameter count.
In this paper we propose to use HyperNetworks to break the fixed ratio of FLOPs to parameters of standard convolutions.
arXiv Detail & Related papers (2021-05-18T12:27:05Z) - Learning Frequency-aware Dynamic Network for Efficient Super-Resolution [56.98668484450857]
This paper explores a novel frequency-aware dynamic network for dividing the input into multiple parts according to its coefficients in the discrete cosine transform (DCT) domain.
In practice, the high-frequency part will be processed using expensive operations and the lower-frequency part is assigned with cheap operations to relieve the computation burden.
Experiments conducted on benchmark SISR models and datasets show that the frequency-aware dynamic network can be employed for various SISR neural architectures.
arXiv Detail & Related papers (2021-03-15T12:54:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.