An Overview of the JPEG AI Learning-Based Image Coding Standard
- URL: http://arxiv.org/abs/2510.13867v1
- Date: Mon, 13 Oct 2025 08:44:43 GMT
- Title: An Overview of the JPEG AI Learning-Based Image Coding Standard
- Authors: Semih Esenlik, Yaojun Wu, Zhaobin Zhang, Ye-Kui Wang, Kai Zhang, Li Zhang, João Ascenso, Shan Liu,
- Abstract summary: JPEG AI is an emerging learning-based image coding standard developed by Joint Photographic Experts Group.<n>First version of JPEG AI focuses on human vision tasks, demonstrating significant BD-rate reductions compared to existing standards.
- Score: 20.594919652771715
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: JPEG AI is an emerging learning-based image coding standard developed by Joint Photographic Experts Group (JPEG). The scope of the JPEG AI is the creation of a practical learning-based image coding standard offering a single-stream, compact compressed domain representation, targeting both human visualization and machine consumption. Scheduled for completion in early 2025, the first version of JPEG AI focuses on human vision tasks, demonstrating significant BD-rate reductions compared to existing standards, in terms of MS-SSIM, FSIM, VIF, VMAF, PSNR-HVS, IW-SSIM and NLPD quality metrics. Designed to ensure broad interoperability, JPEG AI incorporates various design features to support deployment across diverse devices and applications. This paper provides an overview of the technical features and characteristics of the JPEG AI standard.
Related papers
- Subjective Visual Quality Assessment for High-Fidelity Learning-Based Image Compression [2.296138318128071]
We present a comprehensive subjective visual quality assessment of JPEG AI-compressed images using the JPEG AIC-3 methodology.<n>We reconstructed JND-based quality scales using a unified model based on boosted and plain triplet comparisons.<n>The CVVDP metric achieved the overall highest performance; however, most metrics including CVDP were overly optimistic in predicting the quality of JPEG AI-compressed images.
arXiv Detail & Related papers (2025-04-07T15:16:58Z) - The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine [49.16996486119006]
Deep learning has emerged as a powerful tool in point cloud coding.<n> JPEG has recently finalized the JPEG Pleno Learning-based Point Cloud Coding standard.<n>This paper provides a complete technical description of the JPEG PCC standard.
arXiv Detail & Related papers (2024-09-12T15:20:23Z) - G-Refine: A General Quality Refiner for Text-to-Image Generation [74.16137826891827]
We introduce G-Refine, a general image quality refiner designed to enhance low-quality images without compromising integrity of high-quality ones.
The model is composed of three interconnected modules: a perception quality indicator, an alignment quality indicator, and a general quality enhancement module.
Extensive experimentation reveals that AIGIs after G-Refine outperform in 10+ quality metrics across 4 databases.
arXiv Detail & Related papers (2024-04-29T00:54:38Z) - Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG
Encoder-Decoder [73.48927855855219]
We propose a system that learns to improve the encoding performance by enhancing its internal neural representations on both the encoder and decoder ends.
Experiments demonstrate that our approach successfully improves the rate-distortion performance over JPEG across various quality metrics.
arXiv Detail & Related papers (2022-01-27T20:20:03Z) - Towards Flexible Blind JPEG Artifacts Removal [73.46374658847675]
We propose a flexible blind convolutional neural network, namely FBCNN, that can predict the adjustable quality factor to control the trade-off between artifacts removal and details preservation.
Our proposed FBCNN achieves favorable performance against state-of-the-art methods in terms of both quantitative metrics and visual quality.
arXiv Detail & Related papers (2021-09-29T17:12:10Z) - Improving Cost Learning for JPEG Steganography by Exploiting JPEG Domain
Knowledge [52.880912557105106]
JEC-RL is explicitly designed to tailor the JPEG DCT structure.
It works with the embedding action sampling mechanism under reinforcement learning.
It can achieve good security performance for JPEG images against both advanced feature based and modern CNN based steganalyzers.
arXiv Detail & Related papers (2021-05-09T08:10:41Z) - Learning-based Compression for Material and Texture Recognition [23.668803886355683]
This paper is concerned with learning-based compression schemes whose compressed-domain representations can be utilized to perform visual processing and computer vision tasks directly in the compressed domain.
We adopt the learning-based JPEG-AI framework for performing material and texture recognition using the compressed-domain latent representation at varing bit-rates.
It is also shown that the compressed-domain classification can yield a competitive performance in terms of Top-1 and Top-5 accuracy while using a smaller reduced-complexity classification model.
arXiv Detail & Related papers (2021-04-16T23:16:26Z) - Adapting JPEG XS gains and priorities to tasks and contents [34.99683302788977]
Constant market requirements for a low-complexity image have led to the recent development and standardization of a lightweight image named JPEG XS.
In this work we show that JPEG XS compression can be adapted to a given task and content, such as preserving visual quality on desktop content or maintaining high accuracy in neural network segmentation tasks.
arXiv Detail & Related papers (2020-05-18T14:33:25Z) - Quantization Guided JPEG Artifact Correction [69.04777875711646]
We develop a novel architecture for artifact correction using the JPEG files quantization matrix.
This allows our single model to achieve state-of-the-art performance over models trained for specific quality settings.
arXiv Detail & Related papers (2020-04-17T00:10:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.