Semantic Segmentation in Learned Compressed Domain
- URL: http://arxiv.org/abs/2209.01355v1
- Date: Sat, 3 Sep 2022 07:59:34 GMT
- Title: Semantic Segmentation in Learned Compressed Domain
- Authors: Jinming Liu and Heming Sun and Jiro Katto
- Abstract summary: We propose a method based on the compressed domain to improve segmentation tasks.
Two different modules are explored and analyzed to help the compressed representation be transformed as the features in the segmentation network.
- Score: 21.53261818914534
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Most machine vision tasks (e.g., semantic segmentation) are based on images
encoded and decoded by image compression algorithms (e.g., JPEG). However,
these decoded images in the pixel domain introduce distortion, and they are
optimized for human perception, making the performance of machine vision tasks
suboptimal. In this paper, we propose a method based on the compressed domain
to improve segmentation tasks. i) A dynamic and a static channel selection
method are proposed to reduce the redundancy of compressed representations that
are obtained by encoding. ii) Two different transform modules are explored and
analyzed to help the compressed representation be transformed as the features
in the segmentation network. The experimental results show that we can save up
to 15.8\% bitrates compared with a state-of-the-art compressed domain-based
work while saving up to about 83.6\% bitrates and 44.8\% inference time
compared with the pixel domain-based method.
Related papers
- Progressive Learning with Visual Prompt Tuning for Variable-Rate Image
Compression [60.689646881479064]
We propose a progressive learning paradigm for transformer-based variable-rate image compression.
Inspired by visual prompt tuning, we use LPM to extract prompts for input images and hidden features at the encoder side and decoder side, respectively.
Our model outperforms all current variable image methods in terms of rate-distortion performance and approaches the state-of-the-art fixed image compression methods trained from scratch.
arXiv Detail & Related papers (2023-11-23T08:29:32Z) - CompaCT: Fractal-Based Heuristic Pixel Segmentation for Lossless Compression of High-Color DICOM Medical Images [0.0]
Medical images require a high color depth of 12 bits per pixel component for accurate analysis by physicians.
Standard-based compression of images via filtering is well-known; however, it remains suboptimal in the medical domain due to non-specialized implementations.
This study proposes a medical image compression algorithm, CompaCT, that aims to target spatial features and patterns of pixel concentration for dynamically enhanced data processing.
arXiv Detail & Related papers (2023-08-24T21:43:04Z) - You Can Mask More For Extremely Low-Bitrate Image Compression [80.7692466922499]
Learned image compression (LIC) methods have experienced significant progress during recent years.
LIC methods fail to explicitly explore the image structure and texture components crucial for image compression.
We present DA-Mask that samples visible patches based on the structure and texture of original images.
We propose a simple yet effective masked compression model (MCM), the first framework that unifies LIC and LIC end-to-end for extremely low-bitrate compression.
arXiv Detail & Related papers (2023-06-27T15:36:22Z) - DNN-Compressed Domain Visual Recognition with Feature Adaptation [19.79803434998116]
Learning-based image compression was shown to achieve a competitive performance with state-of-the-art transform-based codecs.
This motivated the development of new learning-based visual compression standards such as JPEG-AI.
This paper is concerned with learning-based compression schemes whose compressed-domain representations can be utilized to perform visual processing and computer vision tasks directly in the compressed domain.
arXiv Detail & Related papers (2023-05-13T20:45:17Z) - Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG
Encoder-Decoder [73.48927855855219]
We propose a system that learns to improve the encoding performance by enhancing its internal neural representations on both the encoder and decoder ends.
Experiments demonstrate that our approach successfully improves the rate-distortion performance over JPEG across various quality metrics.
arXiv Detail & Related papers (2022-01-27T20:20:03Z) - A New Image Codec Paradigm for Human and Machine Uses [53.48873918537017]
A new scalable image paradigm for both human and machine uses is proposed in this work.
The high-level instance segmentation map and the low-level signal features are extracted with neural networks.
An image is designed and trained to achieve the general-quality image reconstruction with the 16-bit gray-scale profile and signal features.
arXiv Detail & Related papers (2021-12-19T06:17:38Z) - A Novel Upsampling and Context Convolution for Image Semantic
Segmentation [0.966840768820136]
Recent methods for semantic segmentation often employ an encoder-decoder structure using deep convolutional neural networks.
We propose a dense upsampling convolution method based on guided filtering to effectively preserve the spatial information of the image in the network.
We report a new record of 82.86% and 81.62% of pixel accuracy on ADE20K and Pascal-Context benchmark datasets, respectively.
arXiv Detail & Related papers (2021-03-20T06:16:42Z) - End-to-end optimized image compression for multiple machine tasks [3.8323580808203785]
We introduce 'Connectors' that are inserted between the decoder and the task algorithms to enable a direct transformation of the compressed content.
We demonstrate the effectiveness of the proposed method by achieving significant rate-accuracy performance improvement for both image classification and object segmentation.
arXiv Detail & Related papers (2021-03-06T19:09:05Z) - Image Compression with Encoder-Decoder Matched Semantic Segmentation [15.536056887418676]
layered image compression is a promising direction.
Some works transmit the semantic segment together with the compressed image data.
We propose a new layered image compression framework with encoder matched semantic segmentation (EDMS)
The proposed EDMS framework can get up to 35.31% BD-rate reduction over the HEVC-based (BPG) encoding time saving.
arXiv Detail & Related papers (2021-01-24T04:11:05Z) - Modeling Lost Information in Lossy Image Compression [72.69327382643549]
Lossy image compression is one of the most commonly used operators for digital images.
We propose a novel invertible framework called Invertible Lossy Compression (ILC) to largely mitigate the information loss problem.
arXiv Detail & Related papers (2020-06-22T04:04:56Z) - Discernible Image Compression [124.08063151879173]
This paper aims to produce compressed images by pursuing both appearance and perceptual consistency.
Based on the encoder-decoder framework, we propose using a pre-trained CNN to extract features of the original and compressed images.
Experiments on benchmarks demonstrate that images compressed by using the proposed method can also be well recognized by subsequent visual recognition and detection models.
arXiv Detail & Related papers (2020-02-17T07:35:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.