Related papers: Orthonormal Convolutions for the Rotation Based Iterative Gaussianization

Orthonormal Convolutions for the Rotation Based Iterative Gaussianization

URL: http://arxiv.org/abs/2206.03860v1
Date: Wed, 8 Jun 2022 12:56:34 GMT
Title: Orthonormal Convolutions for the Rotation Based Iterative Gaussianization
Authors: Valero Laparra, Alexander Hepburn, J. Emmanuel Johnson, Jes\'us Malo
Abstract summary: This paper elaborates an extension of rotation-based iterative Gaussianization, RBIG, which makes image Gaussianization possible. In images its application has been restricted to small image patches or isolated pixels, because rotation in RBIG is based on principal or independent component analysis. We present the emphConvolutional RBIG: an extension that alleviates this issue by imposing that the rotation in RBIG is a convolution.
Score: 64.44661342486434
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper we elaborate an extension of rotation-based iterative Gaussianization, RBIG, which makes image Gaussianization possible. Although RBIG has been successfully applied to many tasks, it is limited to medium dimensionality data (on the order of a thousand dimensions). In images its application has been restricted to small image patches or isolated pixels, because rotation in RBIG is based on principal or independent component analysis and these transformations are difficult to learn and scale. Here we present the \emph{Convolutional RBIG}: an extension that alleviates this issue by imposing that the rotation in RBIG is a convolution. We propose to learn convolutional rotations (i.e. orthonormal convolutions) by optimising for the reconstruction loss between the input and an approximate inverse of the transformation using the transposed convolution operation. Additionally, we suggest different regularizers in learning these orthonormal convolutions. For example, imposing sparsity in the activations leads to a transformation that extends convolutional independent component analysis to multilayer architectures. We also highlight how statistical properties of the data, such as multivariate mutual information, can be obtained from \emph{Convolutional RBIG}. We illustrate the behavior of the transform with a simple example of texture synthesis, and analyze its properties by visualizing the stimuli that maximize the response in certain feature and layer.

Related papers

Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks [0.0]
We show that the same equivariance can be achieved using a simple form of convolution kernels.<n>We implement equivariant neural networks using standard convolution modules.
arXiv Detail & Related papers (2025-05-27T20:27:00Z)
GMR-Conv: An Efficient Rotation and Reflection Equivariant Convolution Kernel Using Gaussian Mixture Rings [4.365790707793824]
We introduce an efficient convolution kernel that smooths radial symmetry using a mixture of Gaussian-weighted rings. Experiments on eight classification and one segmentation datasets demonstrate that GMR-Conv not only matches conventional CNNs' performance but can also surpass it in applications with orientation-less data.
arXiv Detail & Related papers (2025-04-03T17:58:18Z)
Variable-size Symmetry-based Graph Fourier Transforms for image compression [65.7352685872625]
We propose a new family of Symmetry-based Graph Fourier Transforms of variable sizes into a coding framework. Our proposed algorithm generates symmetric graphs on the grid by adding specific symmetrical connections between nodes. Experiments show that SBGFTs outperform the primary transforms integrated in the explicit Multiple Transform Selection.
arXiv Detail & Related papers (2024-11-24T13:00:44Z)
RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification [59.5042031913258]
Non-linear modality discrepancy mainly comes from diverse linear transformations acting on the surface of different materials. We propose a Random Linear Enhancement (RLE) strategy which includes Moderate Random Linear Enhancement (MRLE) and Radical Random Linear Enhancement (RRLE) The experimental results not only demonstrate the superiority and effectiveness of RLE but also confirm its great potential as a general-purpose data augmentation for cross-spectral re-identification.
arXiv Detail & Related papers (2024-11-02T12:13:37Z)
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views [116.10577967146762]
PixelGaussian is an efficient framework for learning generalizable 3D Gaussian reconstruction from arbitrary views. Our method achieves state-of-the-art performance with good generalization to various numbers of views.
arXiv Detail & Related papers (2024-10-24T17:59:58Z)
MGAug: Multimodal Geometric Augmentation in Latent Spaces of Image Deformations [2.711740183729759]
We propose a novel model that generates augmenting transformations in a multimodal latent space of geometric deformations. Experimental results show that our proposed approach outperforms all baselines by significantly improved prediction accuracy.
arXiv Detail & Related papers (2023-12-20T21:30:55Z)
Dual-Octave Convolution for Accelerated Parallel MR Image Reconstruction [75.35200719645283]
We propose the Dual-Octave Convolution (Dual-OctConv), which is capable of learning multi-scale spatial-frequency features from both real and imaginary components. By reformulating the complex operations using octave convolutions, our model shows a strong ability to capture richer representations of MR images.
arXiv Detail & Related papers (2021-04-12T10:51:05Z)
The Geometry of Deep Generative Image Models and its Applications [0.0]
Generative adversarial networks (GANs) have emerged as a powerful unsupervised method to model the statistical patterns of real-world data sets. These networks are trained to map random inputs in their latent space to new samples representative of the learned data. The structure of the latent space is hard to intuit due to its high dimensionality and the non-linearity of the generator.
arXiv Detail & Related papers (2021-01-15T07:57:33Z)
Learnable Gabor modulated complex-valued networks for orientation robustness [4.024850952459758]
Learnable Gabor Convolutional Networks (LGCNs) are parameter-efficient and offer increased model complexity. We investigate the robustness of complex valued convolutional weights with learned Gabor filters to enable orientation transformations.
arXiv Detail & Related papers (2020-11-23T21:22:27Z)
Rotated Ring, Radial and Depth Wise Separable Radial Convolutions [13.481518628796692]
In this work, we address trainable rotation invariant convolutions and the construction of nets. On the one hand, we show that our approach is rotationally invariant for different models and on different public data sets. The rotationally adaptive convolution models presented are more computationally intensive than normal convolution models.
arXiv Detail & Related papers (2020-10-02T09:01:51Z)
Deeply Learned Spectral Total Variation Decomposition [8.679020335206753]
We present a neural network approximation of a non-linear spectral decomposition. We report up to four orders of magnitude ($times 10,000$) speedup in processing of mega-pixel size images.
arXiv Detail & Related papers (2020-06-17T17:10:43Z)
Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data [52.78581260260455]
We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group. We apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems.
arXiv Detail & Related papers (2020-02-25T17:40:38Z)

This list is automatically generated from the titles and abstracts of the papers in this site.