Boosting the Performance of Video Compression Artifact Reduction with
Reference Frame Proposals and Frequency Domain Information
- URL: http://arxiv.org/abs/2105.14962v1
- Date: Mon, 31 May 2021 13:46:11 GMT
- Title: Boosting the Performance of Video Compression Artifact Reduction with
Reference Frame Proposals and Frequency Domain Information
- Authors: Yi Xu, Minyi Zhao, Jing Liu, Xinjian Zhang, Longwen Gao, Shuigeng
Zhou, Huyang Sun
- Abstract summary: We propose an effective reference frame proposal strategy to boost the performance of the existing multi-frame approaches.
Experimental results show that our method achieves better fidelity and perceptual performance on MFQE 2.0 dataset than the state-of-the-art methods.
- Score: 31.053879834073502
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Many deep learning based video compression artifact removal algorithms have
been proposed to recover high-quality videos from low-quality compressed
videos. Recently, methods were proposed to mine spatiotemporal information via
utilizing multiple neighboring frames as reference frames. However, these
post-processing methods take advantage of adjacent frames directly, but neglect
the information of the video itself, which can be exploited. In this paper, we
propose an effective reference frame proposal strategy to boost the performance
of the existing multi-frame approaches. Besides, we introduce a loss based on
fast Fourier transformation~(FFT) to further improve the effectiveness of
restoration. Experimental results show that our method achieves better fidelity
and perceptual performance on MFQE 2.0 dataset than the state-of-the-art
methods. And our method won Track 1 and Track 2, and was ranked the 2nd in
Track 3 of NTIRE 2021 Quality enhancement of heavily compressed videos
Challenge.
Related papers
- FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality [58.80996741843102]
FasterCache is a training-free strategy designed to accelerate the inference of video diffusion models with high-quality generation.
We show that FasterCache can significantly accelerate video generation while keeping video quality comparable to the baseline.
arXiv Detail & Related papers (2024-10-25T07:24:38Z) - DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models [9.145545884814327]
This paper introduces a method for zero-shot video restoration using pre-trained image restoration diffusion models.
We show that our method achieves top performance in zero-shot video restoration.
Our technique works with any 2D restoration diffusion model, offering a versatile and powerful tool for video enhancement tasks without extensive retraining.
arXiv Detail & Related papers (2024-07-01T17:59:12Z) - Implicit Neural Representation for Videos Based on Residual Connection [0.0]
We propose a method that uses low-resolution frames as residual connection that is considered effective for image reconstruction.
Experimental results show that our method outperforms the existing method, HNeRV, in PSNR for 46 of the 49 videos.
arXiv Detail & Related papers (2024-06-15T10:10:48Z) - Blurry Video Compression: A Trade-off between Visual Enhancement and
Data Compression [65.8148169700705]
Existing video compression (VC) methods primarily aim to reduce the spatial and temporal redundancies between consecutive frames in a video.
Previous works have achieved remarkable results on videos acquired under specific settings such as instant (known) exposure time and shutter speed.
In this work, we tackle the VC problem in a general scenario where a given video can be blurry due to predefined camera settings or dynamics in the scene.
arXiv Detail & Related papers (2023-11-08T02:17:54Z) - Perceptual Quality Improvement in Videoconferencing using
Keyframes-based GAN [28.773037051085318]
We propose a novel GAN-based method for compression artifacts reduction in videoconferencing.
First, we extract multi-scale features from the compressed and reference frames.
Then, our architecture combines these features in a progressive manner according to facial landmarks.
arXiv Detail & Related papers (2023-11-07T16:38:23Z) - Valid Information Guidance Network for Compressed Video Quality
Enhancement [10.294638746269298]
We propose a unique Valid Information Guidance scheme (VIG) to enhance the quality of compressed videos.
Our method achieves the state-of-the-art performance of compressed video quality enhancement in terms of accuracy and efficiency.
arXiv Detail & Related papers (2023-02-28T05:43:25Z) - Leveraging Video Coding Knowledge for Deep Video Enhancement [6.746400031322727]
This study proposes a novel framework that leverages the low-delay configuration of video compression to enhance the existing state-of-the-art method, BasicVSR++.
We incorporate a context-adaptive video fusion method to enhance the final quality of compressed videos.
The proposed approach has been evaluated in the NTIRE22 challenge, a benchmark for video restoration and enhancement, and achieved improvements in both quantitative metrics and visual quality compared to the previous method.
arXiv Detail & Related papers (2023-02-27T09:00:29Z) - Deep Unsupervised Key Frame Extraction for Efficient Video
Classification [63.25852915237032]
This work presents an unsupervised method to retrieve the key frames, which combines Convolutional Neural Network (CNN) and Temporal Segment Density Peaks Clustering (TSDPC)
The proposed TSDPC is a generic and powerful framework and it has two advantages compared with previous works, one is that it can calculate the number of key frames automatically.
Furthermore, a Long Short-Term Memory network (LSTM) is added on the top of the CNN to further elevate the performance of classification.
arXiv Detail & Related papers (2022-11-12T20:45:35Z) - Learned Video Compression via Heterogeneous Deformable Compensation
Network [78.72508633457392]
We propose a learned video compression framework via heterogeneous deformable compensation strategy (HDCVC) to tackle the problems of unstable compression performance.
More specifically, the proposed algorithm extracts features from the two adjacent frames to estimate content-Neighborhood heterogeneous deformable (HetDeform) kernel offsets.
Experimental results indicate that HDCVC achieves superior performance than the recent state-of-the-art learned video compression approaches.
arXiv Detail & Related papers (2022-07-11T02:31:31Z) - COMISR: Compression-Informed Video Super-Resolution [76.94152284740858]
Most videos on the web or mobile devices are compressed, and the compression can be severe when the bandwidth is limited.
We propose a new compression-informed video super-resolution model to restore high-resolution content without introducing artifacts caused by compression.
arXiv Detail & Related papers (2021-05-04T01:24:44Z) - Multi-level Wavelet-based Generative Adversarial Network for Perceptual
Quality Enhancement of Compressed Video [51.631731922593225]
Existing methods mainly focus on enhancing the objective quality of compressed video while ignoring its perceptual quality.
We propose a novel generative adversarial network (GAN) based on multi-level wavelet packet transform (WPT) to enhance the perceptual quality of compressed video.
arXiv Detail & Related papers (2020-08-02T15:01:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.