Slimmable Compressive Autoencoders for Practical Neural Image
Compression
- URL: http://arxiv.org/abs/2103.15726v1
- Date: Mon, 29 Mar 2021 16:12:04 GMT
- Title: Slimmable Compressive Autoencoders for Practical Neural Image
Compression
- Authors: Fei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov
- Abstract summary: We propose slimmable compressive autoencoders (SlimCAEs) for practical image compression.
SlimCAEs are highly flexible models that provide excellent rate-distortion performance, variable rate, and dynamic adjustment of memory, computational cost and latency.
- Score: 20.715312224456138
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Neural image compression leverages deep neural networks to outperform
traditional image codecs in rate-distortion performance. However, the resulting
models are also heavy, computationally demanding and generally optimized for a
single rate, limiting their practical use. Focusing on practical image
compression, we propose slimmable compressive autoencoders (SlimCAEs), where
rate (R) and distortion (D) are jointly optimized for different capacities.
Once trained, encoders and decoders can be executed at different capacities,
leading to different rates and complexities. We show that a successful
implementation of SlimCAEs requires suitable capacity-specific RD tradeoffs.
Our experiments show that SlimCAEs are highly flexible models that provide
excellent rate-distortion performance, variable rate, and dynamic adjustment of
memory, computational cost and latency, thus addressing the main requirements
of practical image compression.
Related papers
- Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption [57.056311855630916]
We propose a Controllable Generative Image Compression framework, Control-GIC.
It is capable of fine-grained adaption across a broad spectrum while ensuring high-fidelity and generality compression.
We develop a conditional conditionalization that can trace back to historic encoded multi-granularity representations.
arXiv Detail & Related papers (2024-06-02T14:22:09Z) - Enhancing the Rate-Distortion-Perception Flexibility of Learned Image
Codecs with Conditional Diffusion Decoders [7.485128109817576]
We show that conditional diffusion models can lead to promising results in the generative compression task when used as a decoder.
In this paper, we show that conditional diffusion models can lead to promising results in the generative compression task when used as a decoder.
arXiv Detail & Related papers (2024-03-05T11:48:35Z) - Exploring the Rate-Distortion-Complexity Optimization in Neural Image
Compression [26.1947289647201]
We study the rate-distortion-complexity (RDC) optimization in neural image compression.
By quantifying the decoding complexity as a factor in the optimization goal, we are now able to precisely control the RDC trade-off.
A variable-complexity neural is designed to leverage the spatial dependencies adaptively according to industrial demands.
arXiv Detail & Related papers (2023-05-12T03:56:25Z) - High-Fidelity Variable-Rate Image Compression via Invertible Activation
Transformation [24.379052026260034]
We propose the Invertible Activation Transformation (IAT) module to tackle the issue of high-fidelity fine variable-rate image compression.
IAT and QLevel together give the image compression model the ability of fine variable-rate control while better maintaining the image fidelity.
Our method outperforms the state-of-the-art variable-rate image compression method by a large margin, especially after multiple re-encodings.
arXiv Detail & Related papers (2022-09-12T07:14:07Z) - Cross Modal Compression: Towards Human-comprehensible Semantic
Compression [73.89616626853913]
Cross modal compression is a semantic compression framework for visual data.
We show that our proposed CMC can achieve encouraging reconstructed results with an ultrahigh compression ratio.
arXiv Detail & Related papers (2022-09-06T15:31:11Z) - Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG
Encoder-Decoder [73.48927855855219]
We propose a system that learns to improve the encoding performance by enhancing its internal neural representations on both the encoder and decoder ends.
Experiments demonstrate that our approach successfully improves the rate-distortion performance over JPEG across various quality metrics.
arXiv Detail & Related papers (2022-01-27T20:20:03Z) - Modeling Image Quantization Tradeoffs for Optimal Compression [0.0]
Lossy compression algorithms target tradeoffs by quantizating high frequency data to increase compression rates.
We propose a new method of optimizing quantization tables using Deep Learning and a minimax loss function.
arXiv Detail & Related papers (2021-12-14T07:35:22Z) - Implicit Neural Representations for Image Compression [103.78615661013623]
Implicit Neural Representations (INRs) have gained attention as a novel and effective representation for various data types.
We propose the first comprehensive compression pipeline based on INRs including quantization, quantization-aware retraining and entropy coding.
We find that our approach to source compression with INRs vastly outperforms similar prior work.
arXiv Detail & Related papers (2021-12-08T13:02:53Z) - Substitutional Neural Image Compression [48.20906717052056]
Substitutional Neural Image Compression (SNIC) is a general approach for enhancing any neural image compression model.
It boosts compression performance toward a flexible distortion metric and enables bit-rate control using a single model instance.
arXiv Detail & Related papers (2021-05-16T20:53:31Z) - Quantization Guided JPEG Artifact Correction [69.04777875711646]
We develop a novel architecture for artifact correction using the JPEG files quantization matrix.
This allows our single model to achieve state-of-the-art performance over models trained for specific quality settings.
arXiv Detail & Related papers (2020-04-17T00:10:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.