Bit Distribution Study and Implementation of Spatial Quality Map in the
JPEG-AI Standardization
- URL: http://arxiv.org/abs/2402.17470v1
- Date: Tue, 27 Feb 2024 12:52:44 GMT
- Title: Bit Distribution Study and Implementation of Spatial Quality Map in the
JPEG-AI Standardization
- Authors: Panqi Jia, Jue Mao, Esin Koyuncu, A. Burakhan Koyuncu, Timofey
Solovyev, Alexander Karabutov, Yin Zhao, Elena Alshina, Andre Kaup
- Abstract summary: The JPEG-AI verification model has been released and is currently under development for standardization.
We propose a spatial bit allocation method to optimize the JPEG-AI verification model's bit distribution and enhance the visual quality.
- Score: 39.71764233394706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Currently, there is a high demand for neural network-based image compression
codecs. These codecs employ non-linear transforms to create compact bit
representations and facilitate faster coding speeds on devices compared to the
hand-crafted transforms used in classical frameworks. The scientific and
industrial communities are highly interested in these properties, leading to
the standardization effort of JPEG-AI. The JPEG-AI verification model has been
released and is currently under development for standardization. Utilizing
neural networks, it can outperform the classic codec VVC intra by over 10%
BD-rate operating at base operation point. Researchers attribute this success
to the flexible bit distribution in the spatial domain, in contrast to VVC
intra's anchor that is generated with a constant quality point. However, our
study reveals that VVC intra displays a more adaptable bit distribution
structure through the implementation of various block sizes. As a result of our
observations, we have proposed a spatial bit allocation method to optimize the
JPEG-AI verification model's bit distribution and enhance the visual quality.
Furthermore, by applying the VVC bit distribution strategy, the objective
performance of JPEG-AI verification mode can be further improved, resulting in
a maximum gain of 0.45 dB in PSNR-Y.
Related papers
- Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model [45.64945099984195]
The research on neural network (NN) based image compression has shown superior performance compared to classical compression frameworks.
The current state of the JPEG-AI verification model experiences significant slowdowns during bit rate matching.
The proposed methodology offers a fourfold acceleration and over 1% improvement in BD-rate at the base operation point.
arXiv Detail & Related papers (2024-02-27T13:12:18Z) - ConvNeXt-ChARM: ConvNeXt-based Transform for Efficient Neural Image
Compression [18.05997169440533]
We propose ConvNeXt-ChARM, an efficient ConvNeXt-based transform coding framework, paired with a compute-efficient channel-wise auto-regressive auto-regressive.
We show that ConvNeXt-ChARM brings consistent and significant BD-rate (PSNR) reductions estimated on average to 5.24% and 1.22% over the versatile video coding (VVC) reference encoder (VTM-18.0) and the state-of-the-art learned image compression method SwinT-ChARM.
arXiv Detail & Related papers (2023-07-12T11:45:54Z) - Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient
Neural Image Compression [11.25130799452367]
We propose an absolute image compression transformer (ICT) for neural image compression (NIC)
ICT captures both global and local contexts from the latent representations and better parameterize the distribution of the quantized latents.
Our framework significantly improves the trade-off between coding efficiency and decoder complexity over the versatile video coding (VVC) reference encoder (VTM-18.0) and the neural SwinT-ChARM.
arXiv Detail & Related papers (2023-07-05T13:17:14Z) - Modality-Agnostic Variational Compression of Implicit Neural
Representations [96.35492043867104]
We introduce a modality-agnostic neural compression algorithm based on a functional view of data and parameterised as an Implicit Neural Representation (INR)
Bridging the gap between latent coding and sparsity, we obtain compact latent representations non-linearly mapped to a soft gating mechanism.
After obtaining a dataset of such latent representations, we directly optimise the rate/distortion trade-off in a modality-agnostic space using neural compression.
arXiv Detail & Related papers (2023-01-23T15:22:42Z) - Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG
Encoder-Decoder [73.48927855855219]
We propose a system that learns to improve the encoding performance by enhancing its internal neural representations on both the encoder and decoder ends.
Experiments demonstrate that our approach successfully improves the rate-distortion performance over JPEG across various quality metrics.
arXiv Detail & Related papers (2022-01-27T20:20:03Z) - Learned Multi-Resolution Variable-Rate Image Compression with
Octave-based Residual Blocks [15.308823742699039]
We propose a new variable-rate image compression framework, which employs generalized octave convolutions (GoConv) and generalized octave transposed-convolutions (GoTConv)
To enable a single model to operate with different bit rates and to learn multi-rate image features, a new objective function is introduced.
Experimental results show that the proposed framework trained with variable-rate objective function outperforms the standard codecs such as H.265/HEVC-based BPG and state-of-the-art learning-based variable-rate methods.
arXiv Detail & Related papers (2020-12-31T06:26:56Z) - Learning to Improve Image Compression without Changing the Standard
Decoder [100.32492297717056]
We propose learning to improve the encoding performance with the standard decoder.
Specifically, a frequency-domain pre-editing method is proposed to optimize the distribution of DCT coefficients.
We do not modify the JPEG decoder and therefore our approach is applicable when viewing images with the widely used standard JPEG decoder.
arXiv Detail & Related papers (2020-09-27T19:24:42Z) - Quantization Guided JPEG Artifact Correction [69.04777875711646]
We develop a novel architecture for artifact correction using the JPEG files quantization matrix.
This allows our single model to achieve state-of-the-art performance over models trained for specific quality settings.
arXiv Detail & Related papers (2020-04-17T00:10:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.