Perceptual Quality Assessment of Face Video Compression: A Benchmark and
An Effective Method
- URL: http://arxiv.org/abs/2304.07056v3
- Date: Sun, 29 Oct 2023 14:06:56 GMT
- Title: Perceptual Quality Assessment of Face Video Compression: A Benchmark and
An Effective Method
- Authors: Yixuan Li, Bolin Chen, Baoliang Chen, Meng Wang, Shiqi Wang, Weisi Lin
- Abstract summary: Generative coding approaches have been identified as promising alternatives with reasonable perceptual rate-distortion trade-offs.
The great diversity of distortion types in spatial and temporal domains, ranging from the traditional hybrid coding frameworks to generative models, present grand challenges in compressed face video quality assessment (VQA)
We introduce the large-scale Compressed Face Video Quality Assessment (CFVQA) database, which is the first attempt to systematically understand the perceptual quality and diversified compression distortions in face videos.
- Score: 69.868145936998
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent years have witnessed an exponential increase in the demand for face
video compression, and the success of artificial intelligence has expanded the
boundaries beyond traditional hybrid video coding. Generative coding approaches
have been identified as promising alternatives with reasonable perceptual
rate-distortion trade-offs, leveraging the statistical priors of face videos.
However, the great diversity of distortion types in spatial and temporal
domains, ranging from the traditional hybrid coding frameworks to generative
models, present grand challenges in compressed face video quality assessment
(VQA). In this paper, we introduce the large-scale Compressed Face Video
Quality Assessment (CFVQA) database, which is the first attempt to
systematically understand the perceptual quality and diversified compression
distortions in face videos. The database contains 3,240 compressed face video
clips in multiple compression levels, which are derived from 135 source videos
with diversified content using six representative video codecs, including two
traditional methods based on hybrid coding frameworks, two end-to-end methods,
and two generative methods. In addition, a FAce VideO IntegeRity (FAVOR) index
for face video compression was developed to measure the perceptual quality,
considering the distinct content characteristics and temporal priors of the
face videos. Experimental results exhibit its superior performance on the
proposed CFVQA dataset. The benchmark is now made publicly available at:
https://github.com/Yixuan423/Compressed-Face-Videos-Quality-Assessment.
Related papers
- Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency [36.939731355462264]
This study proposes a novel and efficient blind video face enhancement method.
It restores high-quality videos from their compressed low-quality versions with an effective de-flickering mechanism.
Experiments conducted on the VFHQ-Test dataset demonstrate that our method surpasses the current state-of-the-art blind face video restoration and de-flickering methods on both efficiency and effectiveness.
arXiv Detail & Related papers (2024-11-25T15:14:36Z) - Prediction and Reference Quality Adaptation for Learned Video Compression [54.58691829087094]
We propose a confidence-based prediction quality adaptation (PQA) module to provide explicit discrimination for the spatial and channel-wise prediction quality difference.
We also propose a reference quality adaptation (RQA) module and an associated repeat-long training strategy to provide dynamic spatially variant filters for diverse reference qualities.
arXiv Detail & Related papers (2024-06-20T09:03:26Z) - Compression-Realized Deep Structural Network for Video Quality Enhancement [78.13020206633524]
This paper focuses on the task of quality enhancement for compressed videos.
Most of the existing methods lack a structured design to optimally leverage the priors within compression codecs.
A new paradigm is urgently needed for a more conscious'' process of quality enhancement.
arXiv Detail & Related papers (2024-05-10T09:18:17Z) - Perceptual Quality Improvement in Videoconferencing using
Keyframes-based GAN [28.773037051085318]
We propose a novel GAN-based method for compression artifacts reduction in videoconferencing.
First, we extract multi-scale features from the compressed and reference frames.
Then, our architecture combines these features in a progressive manner according to facial landmarks.
arXiv Detail & Related papers (2023-11-07T16:38:23Z) - High Visual-Fidelity Learned Video Compression [6.609832462227998]
We propose a novel High Visual-Fidelity Learned Video Compression framework (HVFVC)
Specifically, we design a novel confidence-based feature reconstruction method to address the issue of poor reconstruction in newly-emerged regions.
Extensive experiments have shown that the proposed HVFVC achieves excellent perceptual quality, outperforming the latest VVC standard with only 50% required.
arXiv Detail & Related papers (2023-10-07T03:27:45Z) - Learned Video Compression via Heterogeneous Deformable Compensation
Network [78.72508633457392]
We propose a learned video compression framework via heterogeneous deformable compensation strategy (HDCVC) to tackle the problems of unstable compression performance.
More specifically, the proposed algorithm extracts features from the two adjacent frames to estimate content-Neighborhood heterogeneous deformable (HetDeform) kernel offsets.
Experimental results indicate that HDCVC achieves superior performance than the recent state-of-the-art learned video compression approaches.
arXiv Detail & Related papers (2022-07-11T02:31:31Z) - Neural Weight Step Video Compression [0.5772546394254112]
In this work, we suggest a set of experiments for testing the feasibility of compressing video using two architectural paradigms.
We propose a novel technique of encoding frames of a video as low-entropy parameter updates.
To assess the feasibility of the considered approaches, we will test the video compression performance on several high-resolution video datasets.
arXiv Detail & Related papers (2021-12-02T18:53:05Z) - Perceptual Learned Video Compression with Recurrent Conditional GAN [158.0726042755]
We propose a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional generative adversarial network.
PLVC learns to compress video towards good perceptual quality at low bit-rate.
The user study further validates the outstanding perceptual performance of PLVC in comparison with the latest learned video compression approaches.
arXiv Detail & Related papers (2021-09-07T13:36:57Z) - COMISR: Compression-Informed Video Super-Resolution [76.94152284740858]
Most videos on the web or mobile devices are compressed, and the compression can be severe when the bandwidth is limited.
We propose a new compression-informed video super-resolution model to restore high-resolution content without introducing artifacts caused by compression.
arXiv Detail & Related papers (2021-05-04T01:24:44Z) - Feedback Recurrent Autoencoder for Video Compression [14.072596106425072]
We propose a new network architecture for learned video compression operating in low latency mode.
Our method yields state of the art MS-SSIM/rate performance on the high-resolution UVG dataset.
arXiv Detail & Related papers (2020-04-09T02:58:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.