Related papers: PocketDVDNet: Realtime Video Denoising for Real Camera Noise

PocketDVDNet: Realtime Video Denoising for Real Camera Noise

URL: http://arxiv.org/abs/2601.16780v1
Date: Fri, 23 Jan 2026 14:27:03 GMT
Title: PocketDVDNet: Realtime Video Denoising for Real Camera Noise
Authors: Crispian Morris, Imogen Dexter, Fan Zhang, David R. Bull, Nantheera Anantrasirichai,
Abstract summary: We propose PocketDVDNet, a lightweight video denoiser developed using our model compression framework.<n>We induce sparsity, apply targeted channel pruning, and retrain a teacher on realistic multi-component noise.<n>PocketDVDNet reduces the original model size by 74% while improving denoising quality and processing 5-frame patches in real-time.
Score: 7.3429091913205164
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Live video denoising under realistic, multi-component sensor noise remains challenging for applications such as autofocus, autonomous driving, and surveillance. We propose PocketDVDNet, a lightweight video denoiser developed using our model compression framework that combines sparsity-guided structured pruning, a physics-informed noise model, and knowledge distillation to achieve high-quality restoration with reduced resource demands. Starting from a reference model, we induce sparsity, apply targeted channel pruning, and retrain a teacher on realistic multi-component noise. The student network learns implicit noise handling, eliminating the need for explicit noise-map inputs. PocketDVDNet reduces the original model size by 74% while improving denoising quality and processing 5-frame patches in real-time. These results demonstrate that aggressive compression, combined with domain-adapted distillation, can reconcile performance and efficiency for practical, real-time video denoising.

Related papers

LLVD: LSTM-based Explicit Motion Modeling in Latent Space for Blind Video Denoising [1.9253333342733672]
This paper introduces a novel algorithm designed for scenarios where noise is introduced during video capture.<n>We propose the Latent space LSTM Video Denoiser (LLVD), an end-to-end blind denoising model.<n> Experiments reveal that LLVD demonstrates excellent performance for both synthetic and captured noise.
arXiv Detail & Related papers (2025-01-10T06:20:27Z)
RViDeformer: Efficient Raw Video Denoising Transformer with a Larger Benchmark Dataset [15.340530514779804]
There is no large dataset with realistic motions for supervised raw video denoising.<n>We construct a video denoising dataset (named as ReCRVD) with 120 groups of noisy-clean videos.<n>We propose an efficient raw video denoising transformer network (RViDeformer) that explores both short and long-distance correlations.
arXiv Detail & Related papers (2023-05-01T11:06:58Z)
Real-time Controllable Denoising for Image and Video [44.68523669975698]
Controllable image denoising aims to generate clean samples with human priors and balance sharpness and smoothness. We introduce Real-time Controllable Denoising (RCD), the first deep image and video denoising pipeline. RCD provides a fully controllable user interface to edit arbitrary denoising levels in real-time with only one-time network inference.
arXiv Detail & Related papers (2023-03-29T03:10:28Z)
Low Latency Video Denoising for Online Conferencing Using CNN Architectures [4.7805617044617446]
We propose a pipeline for real-time video denoising with low runtime cost and high perceptual quality. A custom noise detector analyzer provides real-time feedback to adapt the weights and improve the models' output.
arXiv Detail & Related papers (2023-02-17T00:55:54Z)
Learning Task-Oriented Flows to Mutually Guide Feature Alignment in Synthesized and Real Video Denoising [137.5080784570804]
Video denoising aims at removing noise from videos to recover clean ones. Some existing works show that optical flow can help the denoising by exploiting the additional spatial-temporal clues from nearby frames. We propose a new multi-scale refined optical flow-guided video denoising method, which is more robust to different noise levels.
arXiv Detail & Related papers (2022-08-25T00:09:18Z)
PVDD: A Practical Video Denoising Dataset with Real-World Dynamic Scenes [56.4361151691284]
"Practical Video Denoising dataset" (PVDD) contains 200 noisy-clean dynamic video pairs in both sRGB and RAW format. Compared with existing datasets consisting of limited motion information,PVDD covers dynamic scenes with varying natural motion.
arXiv Detail & Related papers (2022-07-04T12:30:22Z)
Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training [50.018580462619425]
We propose a novel framework, namely Pixel-level Noise-aware Generative Adrial Network (PNGAN) PNGAN employs a pre-trained real denoiser to map the fake and real noisy images into a nearly noise-free solution space. For better noise fitting, we present an efficient architecture Simple Multi-versa-scale Network (SMNet) as the generator.
arXiv Detail & Related papers (2022-04-06T14:09:02Z)
Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis [148.16279746287452]
We propose a swin-conv block to incorporate the local modeling ability of residual convolutional layer and non-local modeling ability of swin transformer block. For the training data synthesis, we design a practical noise degradation model which takes into consideration different kinds of noise. Experiments on AGWN removal and real image denoising demonstrate that the new network architecture design achieves state-of-the-art performance.
arXiv Detail & Related papers (2022-03-24T18:11:31Z)
Neural Compression-Based Feature Learning for Video Restoration [29.021502115116736]
This paper proposes learning noise-robust feature representations to help video restoration. We design a neural compression module to filter the noise and keep the most useful information in features for video restoration.
arXiv Detail & Related papers (2022-03-17T09:59:26Z)
IDR: Self-Supervised Image Denoising via Iterative Data Refinement [66.5510583957863]
We present a practical unsupervised image denoising method to achieve state-of-the-art denoising performance. Our method only requires single noisy images and a noise model, which is easily accessible in practical raw image denoising. To evaluate raw image denoising performance in real-world applications, we build a high-quality raw image dataset SenseNoise-500 that contains 500 real-life scenes.
arXiv Detail & Related papers (2021-11-29T07:22:53Z)
Physics-based Noise Modeling for Extreme Low-light Photography [63.65570751728917]
We study the noise statistics in the imaging pipeline of CMOS photosensors. We formulate a comprehensive noise model that can accurately characterize the real noise structures. Our noise model can be used to synthesize realistic training data for learning-based low-light denoising algorithms.
arXiv Detail & Related papers (2021-08-04T16:36:29Z)
Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising [104.59305271099967]
We present a pixel aggregation network and learn the pixel sampling and averaging strategies for image denoising. We develop a pixel aggregation network for video denoising to sample pixels across the spatial-temporal space. Our method is able to solve the misalignment issues caused by large motion in dynamic scenes.
arXiv Detail & Related papers (2021-01-26T13:00:46Z)

This list is automatically generated from the titles and abstracts of the papers in this site.