CBANet: Towards Complexity and Bitrate Adaptive Deep Image Compression
using a Single Network
- URL: http://arxiv.org/abs/2105.12386v1
- Date: Wed, 26 May 2021 08:13:56 GMT
- Title: CBANet: Towards Complexity and Bitrate Adaptive Deep Image Compression
using a Single Network
- Authors: Jinyang Guo, Dong Xu, Guo Lu
- Abstract summary: We propose a new deep image compression framework called Complexity and Bitrate Adaptive Network (CBANet)
Our CBANet considers the trade-off between the rate and distortion under dynamic computational complexity constraints.
As a result, our CBANet enables one single to support multiple decoding under various computational complexity constraints.
- Score: 24.418215098116335
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we propose a new deep image compression framework called
Complexity and Bitrate Adaptive Network (CBANet), which aims to learn one
single network to support variable bitrate coding under different computational
complexity constraints. In contrast to the existing state-of-the-art learning
based image compression frameworks that only consider the rate-distortion
trade-off without introducing any constraint related to the computational
complexity, our CBANet considers the trade-off between the rate and distortion
under dynamic computational complexity constraints. Specifically, to decode the
images with one single decoder under various computational complexity
constraints, we propose a new multi-branch complexity adaptive module, in which
each branch only takes a small portion of the computational budget of the
decoder. The reconstructed images with different visual qualities can be
readily generated by using different numbers of branches. Furthermore, to
achieve variable bitrate decoding with one single decoder, we propose a bitrate
adaptive module to project the representation from a base bitrate to the
expected representation at a target bitrate for transmission. Then it will
project the transmitted representation at the target bitrate back to that at
the base bitrate for the decoding process. The proposed bit adaptive module can
significantly reduce the storage requirement for deployment platforms. As a
result, our CBANet enables one single codec to support multiple bitrate
decoding under various computational complexity constraints. Comprehensive
experiments on two benchmark datasets demonstrate the effectiveness of our
CBANet for deep image compression.
Related papers
- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression [47.72668401825835]
We propose a rate-distortion-cognition controllable versatile image compression method.
Our method yields satisfactory ICM performance and flexible Rate-DistortionCognition controlling.
arXiv Detail & Related papers (2024-07-16T13:17:51Z) - AdaBM: On-the-Fly Adaptive Bit Mapping for Image Super-Resolution [53.23803932357899]
We introduce the first on-the-fly adaptive quantization framework that accelerates the processing time from hours to seconds.
We achieve competitive performance with the previous adaptive quantization methods, while the processing time is accelerated by x2000.
arXiv Detail & Related papers (2024-04-04T08:37:27Z) - MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model [78.4051835615796]
This paper proposes a method called Multimodal Image Semantic Compression.
It consists of an LMM encoder for extracting the semantic information of the image, a map encoder to locate the region corresponding to the semantic, an image encoder generates an extremely compressed bitstream, and a decoder reconstructs the image based on the above information.
It can achieve optimal consistency and perception results while saving perceptual 50%, which has strong potential applications in the next generation of storage and communication.
arXiv Detail & Related papers (2024-02-26T17:11:11Z) - Progressive Learning with Visual Prompt Tuning for Variable-Rate Image
Compression [60.689646881479064]
We propose a progressive learning paradigm for transformer-based variable-rate image compression.
Inspired by visual prompt tuning, we use LPM to extract prompts for input images and hidden features at the encoder side and decoder side, respectively.
Our model outperforms all current variable image methods in terms of rate-distortion performance and approaches the state-of-the-art fixed image compression methods trained from scratch.
arXiv Detail & Related papers (2023-11-23T08:29:32Z) - Computationally-Efficient Neural Image Compression with Shallow Decoders [43.115831685920114]
This paper takes a step forward towards closing the gap in decoding complexity by using a shallow or even linear decoding transform resembling that of JPEG.
We exploit the often asymmetrical budget between encoding and decoding, by adopting more powerful encoder networks and iterative encoding.
arXiv Detail & Related papers (2023-04-13T03:38:56Z) - Split Hierarchical Variational Compression [21.474095984110622]
Variational autoencoders (VAEs) have witnessed great success in performing the compression of image datasets.
SHVC introduces an efficient autoregressive sub-pixel convolution, that allows a generalisation between per-pixel autoregressions and fully factorised probability models.
arXiv Detail & Related papers (2022-04-05T09:13:38Z) - Learned Multi-Resolution Variable-Rate Image Compression with
Octave-based Residual Blocks [15.308823742699039]
We propose a new variable-rate image compression framework, which employs generalized octave convolutions (GoConv) and generalized octave transposed-convolutions (GoTConv)
To enable a single model to operate with different bit rates and to learn multi-rate image features, a new objective function is introduced.
Experimental results show that the proposed framework trained with variable-rate objective function outperforms the standard codecs such as H.265/HEVC-based BPG and state-of-the-art learning-based variable-rate methods.
arXiv Detail & Related papers (2020-12-31T06:26:56Z) - How to Exploit the Transferability of Learned Image Compression to
Conventional Codecs [25.622863999901874]
We show how learned image coding can be used as a surrogate to optimize an image for encoding.
Our approach can remodel a conventional image to adjust for the MS-SSIM distortion with over 20% rate improvement without any decoding overhead.
arXiv Detail & Related papers (2020-12-03T12:34:51Z) - Conditional Entropy Coding for Efficient Video Compression [82.35389813794372]
We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames.
We first show that a simple architecture modeling the entropy between the image latent codes is as competitive as other neural video compression works and video codecs.
We then propose a novel internal learning extension on top of this architecture that brings an additional 10% savings without trading off decoding speed.
arXiv Detail & Related papers (2020-08-20T20:01:59Z) - Modeling Lost Information in Lossy Image Compression [72.69327382643549]
Lossy image compression is one of the most commonly used operators for digital images.
We propose a novel invertible framework called Invertible Lossy Compression (ILC) to largely mitigate the information loss problem.
arXiv Detail & Related papers (2020-06-22T04:04:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.