Generative structured normalizing flow Gaussian processes applied to
spectroscopic data
- URL: http://arxiv.org/abs/2212.07554v1
- Date: Wed, 14 Dec 2022 23:57:46 GMT
- Title: Generative structured normalizing flow Gaussian processes applied to
spectroscopic data
- Authors: Natalie Klein, Nishant Panda, Patrick Gasda, Diane Oyen
- Abstract summary: In the physical sciences, limited training data may not adequately characterize future observed data.
It is critical that models adequately indicate uncertainty, particularly when they may be asked to extrapolate.
We demonstrate the methodology on laser-induced breakdown spectroscopy data from the ChemCam instrument onboard the Mars rover Curiosity.
- Score: 4.0773490083614075
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we propose a novel generative model for mapping inputs to
structured, high-dimensional outputs using structured conditional normalizing
flows and Gaussian process regression. The model is motivated by the need to
characterize uncertainty in the input/output relationship when making
inferences on new data. In particular, in the physical sciences, limited
training data may not adequately characterize future observed data; it is
critical that models adequately indicate uncertainty, particularly when they
may be asked to extrapolate. In our proposed model, structured conditional
normalizing flows provide parsimonious latent representations that relate to
the inputs through a Gaussian process, providing exact likelihood calculations
and uncertainty that naturally increases away from the training data inputs. We
demonstrate the methodology on laser-induced breakdown spectroscopy data from
the ChemCam instrument onboard the Mars rover Curiosity. ChemCam was designed
to recover the chemical composition of rock and soil samples by measuring the
spectral properties of plasma atomic emissions induced by a laser pulse. We
show that our model can generate realistic spectra conditional on a given
chemical composition and that we can use the model to perform uncertainty
quantification of chemical compositions for new observed spectra. Based on our
results, we anticipate that our proposed modeling approach may be useful in
other scientific domains with high-dimensional, complex structure where it is
important to quantify predictive uncertainty.
Related papers
- Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - Targeting the partition function of chemically disordered materials with a generative approach based on inverse variational autoencoders [0.0]
We propose a novel approach where generative machine learning is used to yield a representative set of configurations for accurate property evaluation.
Our method employs a specific type of variational autoencoder with inverse roles for the encoder and decoder.
We illustrate our approach by computing point-defect formation energies and concentrations in (U, Pu)O2 mixed-oxide fuels.
arXiv Detail & Related papers (2024-08-27T10:05:37Z) - Balancing Molecular Information and Empirical Data in the Prediction of Physico-Chemical Properties [8.649679686652648]
We propose a general method for combining molecular descriptors with representation learning.
The proposed hybrid model exploits chemical structure information using graph neural networks.
It automatically detects cases where structure-based predictions are unreliable, in which case it corrects them by representation-learning based predictions.
arXiv Detail & Related papers (2024-06-12T10:51:00Z) - Enhanced sampling of robust molecular datasets with uncertainty-based
collective variables [0.0]
We propose a method that leverages uncertainty as the collective variable (CV) to guide the acquisition of chemically-relevant data points.
This approach employs a Gaussian Mixture Model-based uncertainty metric from a single model as the CV for biased molecular dynamics simulations.
arXiv Detail & Related papers (2024-02-06T06:42:51Z) - Scalable Diffusion for Materials Generation [99.71001883652211]
We develop a unified crystal representation that can represent any crystal structure (UniMat)
UniMat can generate high fidelity crystal structures from larger and more complex chemical systems.
We propose additional metrics for evaluating generative models of materials.
arXiv Detail & Related papers (2023-10-18T15:49:39Z) - Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional
Analysis and Symbolic Regression [68.8204255655161]
The deep learning revolution has opened the door for deriving such analytical results directly with a computer algorithm fitting to the data.
We successfully demonstrate the use of symbolic regression on synthetic data for the transit radii of generic hot Jupiter exoplanets.
As a preprocessing step, we use dimensional analysis to identify the relevant dimensionless combinations of variables.
arXiv Detail & Related papers (2021-12-22T00:52:56Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - Learning Neural Generative Dynamics for Molecular Conformation
Generation [89.03173504444415]
We study how to generate molecule conformations (textiti.e., 3D structures) from a molecular graph.
We propose a novel probabilistic framework to generate valid and diverse conformations given a molecular graph.
arXiv Detail & Related papers (2021-02-20T03:17:58Z) - Goal-directed Generation of Discrete Structures with Conditional
Generative Models [85.51463588099556]
We introduce a novel approach to directly optimize a reinforcement learning objective, maximizing an expected reward.
We test our methodology on two tasks: generating molecules with user-defined properties and identifying short python expressions which evaluate to a given target value.
arXiv Detail & Related papers (2020-10-05T20:03:13Z) - Physics-Constrained Predictive Molecular Latent Space Discovery with
Graph Scattering Variational Autoencoder [0.0]
We develop a molecular generative model based on variational inference and graph theory in the small data regime.
The model's performance is evaluated by generating molecules with desired target properties.
arXiv Detail & Related papers (2020-09-29T09:05:27Z) - Embedded-physics machine learning for coarse-graining and collective
variable discovery without data [3.222802562733787]
We present a novel learning framework that consistently embeds underlying physics.
We propose a novel objective based on reverse Kullback-Leibler divergence that fully incorporates the available physics in the form of the atomistic force field.
We demonstrate the algorithmic advances in terms of predictive ability and the physical meaning of the revealed CVs for a bimodal potential energy function and the alanine dipeptide.
arXiv Detail & Related papers (2020-02-24T10:28:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.