Higher Order Gauge Equivariant CNNs on Riemannian Manifolds and
Applications
- URL: http://arxiv.org/abs/2305.16657v1
- Date: Fri, 26 May 2023 06:02:31 GMT
- Title: Higher Order Gauge Equivariant CNNs on Riemannian Manifolds and
Applications
- Authors: Gianfranco Cortes, Yue Yu, Robin Chen, Melissa Armstrong, David
Vaillancourt, Baba C. Vemuri
- Abstract summary: We introduce a higher order generalization of the gauge equivariant convolution, dubbed a gauge equivariant Volterra network (GEVNet)
This allows us to model spatially extended nonlinear interactions within a given field while still maintaining equivariance to global isometries.
In the neuroimaging data experiments, the resulting two-part architecture is used to automatically discriminate between patients with Lewy Body Disease (DLB), Alzheimer's Disease (AD) and Parkinson's Disease (PD) from diffusion magnetic resonance images (dMRI)
- Score: 7.322121417864824
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the advent of group equivariant convolutions in deep networks
literature, spherical CNNs with $\mathsf{SO}(3)$-equivariant layers have been
developed to cope with data that are samples of signals on the sphere $S^2$.
One can implicitly obtain $\mathsf{SO}(3)$-equivariant convolutions on $S^2$
with significant efficiency gains by explicitly requiring gauge equivariance
w.r.t. $\mathsf{SO}(2)$. In this paper, we build on this fact by introducing a
higher order generalization of the gauge equivariant convolution, whose
implementation is dubbed a gauge equivariant Volterra network (GEVNet). This
allows us to model spatially extended nonlinear interactions within a given
receptive field while still maintaining equivariance to global isometries. We
prove theoretical results regarding the equivariance and construction of higher
order gauge equivariant convolutions. Then, we empirically demonstrate the
parameter efficiency of our model, first on computer vision benchmark data
(e.g. spherical MNIST), and then in combination with a convolutional kernel
network (CKN) on neuroimaging data. In the neuroimaging data experiments, the
resulting two-part architecture (CKN + GEVNet) is used to automatically
discriminate between patients with Lewy Body Disease (DLB), Alzheimer's Disease
(AD) and Parkinson's Disease (PD) from diffusion magnetic resonance images
(dMRI). The GEVNet extracts micro-architectural features within each voxel,
while the CKN extracts macro-architectural features across voxels. This
compound architecture is uniquely poised to exploit the intra- and inter-voxel
information contained in the dMRI data, leading to improved performance over
the classification results obtained from either of the individual components.
Related papers
- Geodesic Optimization for Predictive Shift Adaptation on EEG data [53.58711912565724]
Domain adaptation methods struggle when distribution shifts occur simultaneously in $X$ and $y$.
This paper proposes a novel method termed Geodesic Optimization for Predictive Shift Adaptation (GOPSA) to address test-time multi-source DA.
GOPSA has the potential to combine the advantages of mixed-effects modeling with machine learning for biomedical applications of EEG.
arXiv Detail & Related papers (2024-07-04T12:15:42Z) - Assessing Neural Network Representations During Training Using
Noise-Resilient Diffusion Spectral Entropy [55.014926694758195]
Entropy and mutual information in neural networks provide rich information on the learning process.
We leverage data geometry to access the underlying manifold and reliably compute these information-theoretic measures.
We show that they form noise-resistant measures of intrinsic dimensionality and relationship strength in high-dimensional simulated data.
arXiv Detail & Related papers (2023-12-04T01:32:42Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - High-Dimensional Smoothed Entropy Estimation via Dimensionality
Reduction [14.53979700025531]
We consider the estimation of the differential entropy $h(X+Z)$ via $n$ independently and identically distributed samples of $X$.
Under the absolute-error loss, the above problem has a parametric estimation rate of $fraccDsqrtn$.
We overcome this exponential sample complexity by projecting $X$ to a low-dimensional space via principal component analysis (PCA) before the entropy estimation.
arXiv Detail & Related papers (2023-05-08T13:51:48Z) - Lorentz group equivariant autoencoders [6.858459233149096]
Lorentz group autoencoder (LGAE)
We develop an autoencoder model equivariant with respect to the proper, orthochronous Lorentz group $mathrmSO+(2,1)$, with a latent space living in the representations of the group.
We present our architecture and several experimental results on jets at the LHC and find it outperforms graph and convolutional neural network baseline models on several compression, reconstruction, and anomaly detection metrics.
arXiv Detail & Related papers (2022-12-14T17:19:46Z) - FedLGA: Towards System-Heterogeneity of Federated Learning via Local
Gradient Approximation [21.63719641718363]
We formalize the system-heterogeneous FL problem and propose a new algorithm, called FedLGA, which addresses this problem by bridging the divergence local model updates via epoch approximation.
The results of comprehensive experiments on multiple datasets show that FedLGA outperforms current FL benchmarks against the system-heterogeneity.
arXiv Detail & Related papers (2021-12-22T16:05:09Z) - Inverting brain grey matter models with likelihood-free inference: a
tool for trustable cytoarchitecture measurements [62.997667081978825]
characterisation of the brain grey matter cytoarchitecture with quantitative sensitivity to soma density and volume remains an unsolved challenge in dMRI.
We propose a new forward model, specifically a new system of equations, requiring a few relatively sparse b-shells.
We then apply modern tools from Bayesian analysis known as likelihood-free inference (LFI) to invert our proposed model.
arXiv Detail & Related papers (2021-11-15T09:08:27Z) - VolterraNet: A higher order convolutional network with group
equivariance for homogeneous manifolds [19.39397826006002]
Convolutional neural networks have been highly successful in image-based learning tasks.
Recent work has generalized the traditional convolutional layer of a convolutional neural network to non-Euclidean spaces.
We present a novel higher order Volterra convolutional neural network (VolterraNet) for data defined as samples of functions.
arXiv Detail & Related papers (2021-06-05T19:28:16Z) - Fundamental tradeoffs between memorization and robustness in random
features and neural tangent regimes [15.76663241036412]
We prove for a large class of activation functions that, if the model memorizes even a fraction of the training, then its Sobolev-seminorm is lower-bounded.
Experiments reveal for the first time, (iv) a multiple-descent phenomenon in the robustness of the min-norm interpolator.
arXiv Detail & Related papers (2021-06-04T17:52:50Z) - Brain Image Synthesis with Unsupervised Multivariate Canonical
CSC$\ell_4$Net [122.8907826672382]
We propose to learn dedicated features that cross both intre- and intra-modal variations using a novel CSC$ell_4$Net.
arXiv Detail & Related papers (2021-03-22T05:19:40Z) - Diffusion Earth Mover's Distance and Distribution Embeddings [61.49248071384122]
Diffusion can be computed in $tildeO(n)$ time and is more accurate than similarly fast algorithms such as tree-baseds.
We show Diffusion is fully differentiable, making it amenable to future uses in gradient-descent frameworks such as deep neural networks.
arXiv Detail & Related papers (2021-02-25T13:18:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.