AlphaVC: High-Performance and Efficient Learned Video Compression
- URL: http://arxiv.org/abs/2207.14678v1
- Date: Fri, 29 Jul 2022 13:52:44 GMT
- Title: AlphaVC: High-Performance and Efficient Learned Video Compression
- Authors: Yibo Shi, Yunying Ge, Jing Wang, Jue Mao
- Abstract summary: We introduce conditional-I-frame as the first frame in the GoP, which stabilizes the reconstructed quality and saves the bit-rate.
Second, to efficiently improve the accuracy of inter prediction without increasing the complexity of decoder, we propose a pixel-to-feature motion prediction method at encoder side.
Third, we propose a probability-based entropy skipping method, which not only brings performance gain, but also greatly reduces the runtime of entropy coding.
- Score: 4.807439168741098
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, learned video compression has drawn lots of attention and show a
rapid development trend with promising results. However, the previous works
still suffer from some criticial issues and have a performance gap with
traditional compression standards in terms of widely used PSNR metric. In this
paper, we propose several techniques to effectively improve the performance.
First, to address the problem of accumulative error, we introduce a
conditional-I-frame as the first frame in the GoP, which stabilizes the
reconstructed quality and saves the bit-rate. Second, to efficiently improve
the accuracy of inter prediction without increasing the complexity of decoder,
we propose a pixel-to-feature motion prediction method at encoder side that
helps us to obtain high-quality motion information. Third, we propose a
probability-based entropy skipping method, which not only brings performance
gain, but also greatly reduces the runtime of entropy coding. With these
powerful techniques, this paper proposes AlphaVC, a high-performance and
efficient learned video compression scheme. To the best of our knowledge,
AlphaVC is the first E2E AI codec that exceeds the latest compression standard
VVC on all common test datasets for both PSNR (-28.2% BD-rate saving) and
MSSSIM (-52.2% BD-rate saving), and has very fast encoding (0.001x VVC) and
decoding (1.69x VVC) speeds.
Related papers
- Accelerating Learned Video Compression via Low-Resolution Representation Learning [18.399027308582596]
We introduce an efficiency-optimized framework for learned video compression that focuses on low-resolution representation learning.
Our method achieves performance levels on par with the low-decay P configuration of the H.266 reference software VTM.
arXiv Detail & Related papers (2024-07-23T12:02:57Z) - Blurry Video Compression: A Trade-off between Visual Enhancement and
Data Compression [65.8148169700705]
Existing video compression (VC) methods primarily aim to reduce the spatial and temporal redundancies between consecutive frames in a video.
Previous works have achieved remarkable results on videos acquired under specific settings such as instant (known) exposure time and shutter speed.
In this work, we tackle the VC problem in a general scenario where a given video can be blurry due to predefined camera settings or dynamics in the scene.
arXiv Detail & Related papers (2023-11-08T02:17:54Z) - Leveraging progressive model and overfitting for efficient learned image
compression [14.937446839215868]
We introduce a powerful and flexible LIC framework with multi-scale progressive (MSP) probability model and latent representation overfitting (LOF) technique.
With different predefined profiles, the proposed framework can achieve various balance points between compression efficiency and computational complexity.
Experiments show that the proposed framework achieves 2.5%, 1.0%, and 1.3% Bjontegaard delta bit rate (BD-rate) reduction over the VVC/H.266 standard.
arXiv Detail & Related papers (2022-10-08T21:54:58Z) - Perceptual Learned Video Compression with Recurrent Conditional GAN [158.0726042755]
We propose a Perceptual Learned Video Compression (PLVC) approach with recurrent conditional generative adversarial network.
PLVC learns to compress video towards good perceptual quality at low bit-rate.
The user study further validates the outstanding perceptual performance of PLVC in comparison with the latest learned video compression approaches.
arXiv Detail & Related papers (2021-09-07T13:36:57Z) - ELF-VC: Efficient Learned Flexible-Rate Video Coding [61.10102916737163]
We propose several novel ideas for learned video compression which allow for improved performance for the low-latency mode.
We benchmark our method, which we call ELF-VC, on popular video test sets UVG and MCL-JCV.
Our approach runs at least 5x faster and has fewer parameters than all ML codecs which report these figures.
arXiv Detail & Related papers (2021-04-29T17:50:35Z) - Conditional Entropy Coding for Efficient Video Compression [82.35389813794372]
We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames.
We first show that a simple architecture modeling the entropy between the image latent codes is as competitive as other neural video compression works and video codecs.
We then propose a novel internal learning extension on top of this architecture that brings an additional 10% savings without trading off decoding speed.
arXiv Detail & Related papers (2020-08-20T20:01:59Z) - Content Adaptive and Error Propagation Aware Deep Video Compression [110.31693187153084]
We propose a content adaptive and error propagation aware video compression system.
Our method employs a joint training strategy by considering the compression performance of multiple consecutive frames instead of a single frame.
Instead of using the hand-crafted coding modes in the traditional compression systems, we design an online encoder updating scheme in our system.
arXiv Detail & Related papers (2020-03-25T09:04:24Z) - Learning for Video Compression with Hierarchical Quality and Recurrent
Enhancement [164.7489982837475]
We propose a Hierarchical Learned Video Compression (HLVC) method with three hierarchical quality layers and a recurrent enhancement network.
In our HLVC approach, the hierarchical quality benefits the coding efficiency, since the high quality information facilitates the compression and enhancement of low quality frames at encoder and decoder sides.
arXiv Detail & Related papers (2020-03-04T09:31:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.