aiWave: Volumetric Image Compression with 3-D Trained Affine
Wavelet-like Transform
- URL: http://arxiv.org/abs/2203.05822v1
- Date: Fri, 11 Mar 2022 10:02:01 GMT
- Title: aiWave: Volumetric Image Compression with 3-D Trained Affine
Wavelet-like Transform
- Authors: Dongmei Xue, Haichuan Ma, Li Li, Dong Liu, Zhiwei Xiong
- Abstract summary: Most commonly used volumetric image compression methods are based on wavelet transform, such as JP3D.
In this paper, we first design a 3-D trained wavelet-like transform to enable signal-dependent and non-separable transform.
Then, an affine wavelet basis is introduced to capture the various local correlations in different regions of volumetric images.
- Score: 43.984890290691695
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Volumetric image compression has become an urgent task to effectively
transmit and store images produced in biological research and clinical
practice. At present, the most commonly used volumetric image compression
methods are based on wavelet transform, such as JP3D. However, JP3D employs an
ideal, separable, global, and fixed wavelet basis to convert input images from
pixel domain to frequency domain, which seriously limits its performance. In
this paper, we first design a 3-D trained wavelet-like transform to enable
signal-dependent and non-separable transform. Then, an affine wavelet basis is
introduced to capture the various local correlations in different regions of
volumetric images. Furthermore, we embed the proposed wavelet-like transform to
an end-to-end compression framework called aiWave to enable an adaptive
compression scheme for various datasets. Last but not least, we introduce the
weight sharing strategies of the affine wavelet-like transform according to the
volumetric data characteristics in the axial direction to reduce the amount of
parameters. The experimental results show that: 1) when cooperating our trained
3-D affine wavelet-like transform with a simple factorized entropy module,
aiWave performs better than JP3D and is comparable in terms of encoding and
decoding complexities; 2) when adding a context module to further remove signal
redundancy, aiWave can achieve a much better performance than HEVC.
Related papers
- The Empirical Watershed Wavelet [0.0]
In this paper, we provide theoretical results that permits us to build 2D empirical wavelet filters based on an arbitrary partitioning of the frequency domain.
We also propose an algorithm to detect such partitioning from an image spectrum by combining a scale-space representation to estimate the position of dominant harmonic modes and a watershed transform.
arXiv Detail & Related papers (2024-10-24T22:36:43Z) - WiNet: Wavelet-based Incremental Learning for Efficient Medical Image Registration [68.25711405944239]
Deep image registration has demonstrated exceptional accuracy and fast inference.
Recent advances have adopted either multiple cascades or pyramid architectures to estimate dense deformation fields in a coarse-to-fine manner.
We introduce a model-driven WiNet that incrementally estimates scale-wise wavelet coefficients for the displacement/velocity field across various scales.
arXiv Detail & Related papers (2024-07-18T11:51:01Z) - Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection [53.842568573251214]
Experimental results on three SAR datasets demonstrate that our WBANet significantly outperforms contemporary state-of-the-art methods.
Our WBANet achieves 98.33%, 96.65%, and 96.62% of percentage of correct classification (PCC) on the respective datasets.
arXiv Detail & Related papers (2024-07-18T04:36:10Z) - Wavelet-Like Transform-Based Technology in Response to the Call for
Proposals on Neural Network-Based Image Coding [18.1150260268062]
This paper introduces a novel wavelet-like transform-based end-to-end image coding framework -- iWaveV3.
iWaveV3 incorporates many new features such as affine wavelet-like transform, perceptual-friendly quality metric, and more advanced training and online optimization strategies.
iWaveV3 is adopted as a candidate scheme for developing the IEEE Standard for neural-network-based image coding.
arXiv Detail & Related papers (2024-03-09T15:13:49Z) - Misalignment-Robust Frequency Distribution Loss for Image Transformation [51.0462138717502]
This paper aims to address a common challenge in deep learning-based image transformation methods, such as image enhancement and super-resolution.
We introduce a novel and simple Frequency Distribution Loss (FDL) for computing distribution distance within the frequency domain.
Our method is empirically proven effective as a training constraint due to the thoughtful utilization of global information in the frequency domain.
arXiv Detail & Related papers (2024-02-28T09:27:41Z) - Neural Wavelet-domain Diffusion for 3D Shape Generation, Inversion, and
Manipulation [54.09274684734721]
We present a new approach for 3D shape generation, inversion, and manipulation, through a direct generative modeling on a continuous implicit representation in wavelet domain.
Specifically, we propose a compact wavelet representation with a pair of coarse and detail coefficient volumes to implicitly represent 3D shapes via truncated signed distance functions and multi-scale biorthogonal wavelets.
We may jointly train an encoder network to learn a latent space for inverting shapes, allowing us to enable a rich variety of whole-shape and region-aware shape manipulations.
arXiv Detail & Related papers (2023-02-01T02:47:53Z) - Wave-ViT: Unifying Wavelet and Transformers for Visual Representation
Learning [138.29273453811945]
Multi-scale Vision Transformer (ViT) has emerged as a powerful backbone for computer vision tasks.
We propose a new Wavelet Vision Transformer (textbfWave-ViT) that formulates the invertible down-sampling with wavelet transforms and self-attention learning.
arXiv Detail & Related papers (2022-07-11T16:03:51Z) - GHM Wavelet Transform for Deep Image Super Resolution [4.522973196613816]
The GHM multi-level discrete wavelet transform is proposed as preprocessing for image super resolution with convolutional neural networks.
37 single-level wavelets are experimentally analyzed from Haar, Daubechies, Biorthogonal, Reverse Biorthogonal, Coiflets, and Symlets wavelet families.
arXiv Detail & Related papers (2022-04-16T19:59:48Z) - iWave3D: End-to-end Brain Image Compression with Trainable 3-D Wavelet
Transform [42.14812529290784]
We propose a trainable 3-D wavelet transform based on the lifting scheme, in which the predict and update steps are replaced by 3-D convolutional neural networks.
Experimental results demonstrate that our method outperforms JP3D significantly by 2.012 dB in terms of average BD-PSNR.
arXiv Detail & Related papers (2021-09-18T14:38:59Z) - Wavelet Channel Attention Module with a Fusion Network for Single Image
Deraining [46.62290347397139]
Single image deraining is a crucial problem because rain severely degenerates the visibility of images.
We propose the new convolutional neural network (CNN) called the wavelet channel attention module with a fusion network.
arXiv Detail & Related papers (2020-07-17T18:06:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.