Generative structured normalizing flow Gaussian processes applied to
spectroscopic data
- URL: http://arxiv.org/abs/2212.07554v1
- Date: Wed, 14 Dec 2022 23:57:46 GMT
- Title: Generative structured normalizing flow Gaussian processes applied to
spectroscopic data
- Authors: Natalie Klein, Nishant Panda, Patrick Gasda, Diane Oyen
- Abstract summary: In the physical sciences, limited training data may not adequately characterize future observed data.
It is critical that models adequately indicate uncertainty, particularly when they may be asked to extrapolate.
We demonstrate the methodology on laser-induced breakdown spectroscopy data from the ChemCam instrument onboard the Mars rover Curiosity.
- Score: 4.0773490083614075
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we propose a novel generative model for mapping inputs to
structured, high-dimensional outputs using structured conditional normalizing
flows and Gaussian process regression. The model is motivated by the need to
characterize uncertainty in the input/output relationship when making
inferences on new data. In particular, in the physical sciences, limited
training data may not adequately characterize future observed data; it is
critical that models adequately indicate uncertainty, particularly when they
may be asked to extrapolate. In our proposed model, structured conditional
normalizing flows provide parsimonious latent representations that relate to
the inputs through a Gaussian process, providing exact likelihood calculations
and uncertainty that naturally increases away from the training data inputs. We
demonstrate the methodology on laser-induced breakdown spectroscopy data from
the ChemCam instrument onboard the Mars rover Curiosity. ChemCam was designed
to recover the chemical composition of rock and soil samples by measuring the
spectral properties of plasma atomic emissions induced by a laser pulse. We
show that our model can generate realistic spectra conditional on a given
chemical composition and that we can use the model to perform uncertainty
quantification of chemical compositions for new observed spectra. Based on our
results, we anticipate that our proposed modeling approach may be useful in
other scientific domains with high-dimensional, complex structure where it is
important to quantify predictive uncertainty.
Related papers
- Balancing Molecular Information and Empirical Data in the Prediction of Physico-Chemical Properties [8.649679686652648]
We propose a general method for combining molecular descriptors with representation learning.
The proposed hybrid model exploits chemical structure information using graph neural networks.
It automatically detects cases where structure-based predictions are unreliable, in which case it corrects them by representation-learning based predictions.
arXiv Detail & Related papers (2024-06-12T10:51:00Z) - Enhanced sampling of robust molecular datasets with uncertainty-based
collective variables [0.0]
We propose a method that leverages uncertainty as the collective variable (CV) to guide the acquisition of chemically-relevant data points.
This approach employs a Gaussian Mixture Model-based uncertainty metric from a single model as the CV for biased molecular dynamics simulations.
arXiv Detail & Related papers (2024-02-06T06:42:51Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Deep-learning-based prediction of nanoparticle phase transitions during
in situ transmission electron microscopy [3.613625739845355]
We train deep learning models to predict a sequence of future video frames based on the input of a sequence of previous frames.
This capability provides insight into size dependent structural changes in Au nanoparticles under dynamic reaction condition.
It may be possible to anticipate the next steps of a chemical reaction for emerging automated experimentation platforms.
arXiv Detail & Related papers (2022-05-23T15:50:24Z) - Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional
Analysis and Symbolic Regression [68.8204255655161]
The deep learning revolution has opened the door for deriving such analytical results directly with a computer algorithm fitting to the data.
We successfully demonstrate the use of symbolic regression on synthetic data for the transit radii of generic hot Jupiter exoplanets.
As a preprocessing step, we use dimensional analysis to identify the relevant dimensionless combinations of variables.
arXiv Detail & Related papers (2021-12-22T00:52:56Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z) - Leveraging Global Parameters for Flow-based Neural Posterior Estimation [90.21090932619695]
Inferring the parameters of a model based on experimental observations is central to the scientific method.
A particularly challenging setting is when the model is strongly indeterminate, i.e., when distinct sets of parameters yield identical observations.
We present a method for cracking such indeterminacy by exploiting additional information conveyed by an auxiliary set of observations sharing global parameters.
arXiv Detail & Related papers (2021-02-12T12:23:13Z) - Goal-directed Generation of Discrete Structures with Conditional
Generative Models [85.51463588099556]
We introduce a novel approach to directly optimize a reinforcement learning objective, maximizing an expected reward.
We test our methodology on two tasks: generating molecules with user-defined properties and identifying short python expressions which evaluate to a given target value.
arXiv Detail & Related papers (2020-10-05T20:03:13Z) - Physics-Constrained Predictive Molecular Latent Space Discovery with
Graph Scattering Variational Autoencoder [0.0]
We develop a molecular generative model based on variational inference and graph theory in the small data regime.
The model's performance is evaluated by generating molecules with desired target properties.
arXiv Detail & Related papers (2020-09-29T09:05:27Z) - Embedded-physics machine learning for coarse-graining and collective
variable discovery without data [3.222802562733787]
We present a novel learning framework that consistently embeds underlying physics.
We propose a novel objective based on reverse Kullback-Leibler divergence that fully incorporates the available physics in the form of the atomistic force field.
We demonstrate the algorithmic advances in terms of predictive ability and the physical meaning of the revealed CVs for a bimodal potential energy function and the alanine dipeptide.
arXiv Detail & Related papers (2020-02-24T10:28:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.