Semi-supervised learning of images with strong rotational disorder:
assembling nanoparticle libraries
- URL: http://arxiv.org/abs/2105.11475v1
- Date: Mon, 24 May 2021 18:01:57 GMT
- Title: Semi-supervised learning of images with strong rotational disorder:
assembling nanoparticle libraries
- Authors: Maxim Ziatdinov, Muammer Yusuf Yaman, Yongtao Liu, David Ginger, and
Sergei V. Kalinin
- Abstract summary: In most cases, experimental data streams contain images having arbitrary rotations and translations within the image.
We develop an approach that allows generalizing from a small subset of labeled data to a large unlabeled dataset.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The proliferation of optical, electron, and scanning probe microscopies gives
rise to large volumes of imaging data of objects as diversified as cells,
bacteria, pollen, to nanoparticles and atoms and molecules. In most cases, the
experimental data streams contain images having arbitrary rotations and
translations within the image. At the same time, for many cases, small amounts
of labeled data are available in the form of prior published results, image
collections, and catalogs, or even theoretical models. Here we develop an
approach that allows generalizing from a small subset of labeled data with a
weak orientational disorder to a large unlabeled dataset with a much stronger
orientational (and positional) disorder, i.e., it performs a classification of
image data given a small number of examples even in the presence of a
distribution shift between the labeled and unlabeled parts. This approach is
based on the semi-supervised rotationally invariant variational autoencoder
(ss-rVAE) model consisting of the encoder-decoder "block" that learns a
rotationally (and translationally) invariant continuous latent representation
of data and a classifier that encodes data into a finite number of discrete
classes. The classifier part of the trained ss-rVAE inherits the rotational
(and translational) invariances and can be deployed independently of the other
parts of the model. The performance of the ss-rVAE is illustrated using the
synthetic data sets with known factors of variation. We further demonstrate its
application for experimental data sets of nanoparticles, creating nanoparticle
libraries and disentangling the representations defining the physical factors
of variation in the data. The code reproducing the results is available at
https://github.com/ziatdinovmax/Semi-Supervised-VAE-nanoparticles.
Related papers
- Learning Invariant Molecular Representation in Latent Discrete Space [52.13724532622099]
We propose a new framework for learning molecular representations that exhibit invariance and robustness against distribution shifts.
Our model achieves stronger generalization against state-of-the-art baselines in the presence of various distribution shifts.
arXiv Detail & Related papers (2023-10-22T04:06:44Z) - Higher Order Gauge Equivariant CNNs on Riemannian Manifolds and
Applications [7.322121417864824]
We introduce a higher order generalization of the gauge equivariant convolution, dubbed a gauge equivariant Volterra network (GEVNet)
This allows us to model spatially extended nonlinear interactions within a given field while still maintaining equivariance to global isometries.
In the neuroimaging data experiments, the resulting two-part architecture is used to automatically discriminate between patients with Lewy Body Disease (DLB), Alzheimer's Disease (AD) and Parkinson's Disease (PD) from diffusion magnetic resonance images (dMRI)
arXiv Detail & Related papers (2023-05-26T06:02:31Z) - VTAE: Variational Transformer Autoencoder with Manifolds Learning [144.0546653941249]
Deep generative models have demonstrated successful applications in learning non-linear data distributions through a number of latent variables.
The nonlinearity of the generator implies that the latent space shows an unsatisfactory projection of the data space, which results in poor representation learning.
We show that geodesics and accurate computation can substantially improve the performance of deep generative models.
arXiv Detail & Related papers (2023-04-03T13:13:19Z) - Physics and Chemistry from Parsimonious Representations: Image Analysis
via Invariant Variational Autoencoders [0.0]
Variational autoencoders (VAEs) are emerging as a powerful paradigm for the unsupervised data analysis.
This article summarizes recent developments in VAEs, covering the basic principles and intuition behind the VAEs.
arXiv Detail & Related papers (2023-03-30T03:16:27Z) - Latent Space Diffusion Models of Cryo-EM Structures [6.968705314671148]
We train a diffusion model as an expressive, learnable prior in the cryoDRGN framework.
By learning an accurate model of the data distribution, our method unlocks tools in generative modeling, sampling, and distribution analysis.
arXiv Detail & Related papers (2022-11-25T15:17:10Z) - Equivariance Allows Handling Multiple Nuisance Variables When Analyzing
Pooled Neuroimaging Datasets [53.34152466646884]
In this paper, we show how bringing recent results on equivariant representation learning instantiated on structured spaces together with simple use of classical results on causal inference provides an effective practical solution.
We demonstrate how our model allows dealing with more than one nuisance variable under some assumptions and can enable analysis of pooled scientific datasets in scenarios that would otherwise entail removing a large portion of the samples.
arXiv Detail & Related papers (2022-03-29T04:54:06Z) - Variational Selective Autoencoder: Learning from Partially-Observed
Heterogeneous Data [45.23338389559936]
We propose the variational selective autoencoder (VSAE) to learn representations from partially-observed heterogeneous data.
VSAE learns the latent dependencies in heterogeneous data by modeling the joint distribution of observed data, unobserved data, and the imputation mask.
It results in a unified model for various downstream tasks including data generation and imputation.
arXiv Detail & Related papers (2021-02-25T04:39:13Z) - Flow-based Generative Models for Learning Manifold to Manifold Mappings [39.60406116984869]
We introduce three kinds of invertible layers for manifold-valued data, which are analogous to their functionality in flow-based generative models.
We show promising results where we can reliably and accurately reconstruct brain images of a field of orientation distribution functions.
arXiv Detail & Related papers (2020-12-18T02:19:18Z) - Data-Driven Discovery of Molecular Photoswitches with Multioutput
Gaussian Processes [51.17758371472664]
Photoswitchable molecules display two or more isomeric forms that may be accessed using light.
We present a data-driven discovery pipeline for molecular photoswitches underpinned by dataset curation and multitask learning.
We validate our proposed approach experimentally by screening a library of commercially available photoswitchable molecules.
arXiv Detail & Related papers (2020-06-28T20:59:03Z) - Modeling Shared Responses in Neuroimaging Studies through MultiView ICA [94.31804763196116]
Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization.
We propose a novel MultiView Independent Component Analysis model for group studies, where data from each subject are modeled as a linear combination of shared independent sources plus noise.
We demonstrate the usefulness of our approach first on fMRI data, where our model demonstrates improved sensitivity in identifying common sources among subjects.
arXiv Detail & Related papers (2020-06-11T17:29:53Z) - Generalizing Convolutional Neural Networks for Equivariance to Lie
Groups on Arbitrary Continuous Data [52.78581260260455]
We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group.
We apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems.
arXiv Detail & Related papers (2020-02-25T17:40:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.