Applications of Gaussian Processes at Extreme Lengthscales: From
Molecules to Black Holes
- URL: http://arxiv.org/abs/2303.14291v1
- Date: Fri, 24 Mar 2023 22:20:14 GMT
- Title: Applications of Gaussian Processes at Extreme Lengthscales: From
Molecules to Black Holes
- Authors: Ryan-Rhys Griffiths
- Abstract summary: This thesis aims to use GP modelling to reason about the latent emission signature from the Seyfert galaxy Markarian 335.
The second contribution is to extend the GP framework to molecular and chemical reaction representations and to provide an open-source software library to enable the framework to be used by scientists.
The fourth contribution is to introduce a Bayesian optimisation scheme capable of modelling aleatoric uncertainty to facilitate the identification of material compositions that possess intrinsic robustness to large scale fabrication processes.
- Score: 4.18804572788063
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In many areas of the observational and experimental sciences data is scarce.
Data observation in high-energy astrophysics is disrupted by celestial
occlusions and limited telescope time while data derived from laboratory
experiments in synthetic chemistry and materials science is time and
cost-intensive to collect. On the other hand, knowledge about the
data-generation mechanism is often available in the sciences, such as the
measurement error of a piece of laboratory apparatus. Both characteristics,
small data and knowledge of the underlying physics, make Gaussian processes
(GPs) ideal candidates for fitting such datasets. GPs can make predictions with
consideration of uncertainty, for example in the virtual screening of molecules
and materials, and can also make inferences about incomplete data such as the
latent emission signature from a black hole accretion disc. Furthermore, GPs
are currently the workhorse model for Bayesian optimisation, a methodology
foreseen to be a guide for laboratory experiments in scientific discovery
campaigns. The first contribution of this thesis is to use GP modelling to
reason about the latent emission signature from the Seyfert galaxy Markarian
335, and by extension, to reason about the applicability of various theoretical
models of black hole accretion discs. The second contribution is to extend the
GP framework to molecular and chemical reaction representations and to provide
an open-source software library to enable the framework to be used by
scientists. The third contribution is to leverage GPs to discover novel and
performant photoswitch molecules. The fourth contribution is to introduce a
Bayesian optimisation scheme capable of modelling aleatoric uncertainty to
facilitate the identification of material compositions that possess intrinsic
robustness to large scale fabrication processes.
Related papers
- Improving Molecular Modeling with Geometric GNNs: an Empirical Study [56.52346265722167]
This paper focuses on the impact of different canonicalization methods, (2) graph creation strategies, and (3) auxiliary tasks, on performance, scalability and symmetry enforcement.
Our findings and insights aim to guide researchers in selecting optimal modeling components for molecular modeling tasks.
arXiv Detail & Related papers (2024-07-11T09:04:12Z) - Additive Multi-Index Gaussian process modeling, with application to
multi-physics surrogate modeling of the quark-gluon plasma [1.7368964547487395]
The Quark-Gluon Plasma (QGP) is a unique phase of nuclear matter, theorized to have filled the Universe shortly after the Big Bang.
To reconcile experimental observables with theoretical parameters, one requires many simulation runs of a complex physics model over a high-dimensional parameter space.
We propose a new Additive Multi-Index Gaussian process (AdMIn-GP) model, which leverages a flexible additive structure on low-dimensional embeddings of the parameter space.
arXiv Detail & Related papers (2023-06-11T02:42:09Z) - GFlowNets for AI-Driven Scientific Discovery [74.27219800878304]
We present a new probabilistic machine learning framework called GFlowNets.
GFlowNets can be applied in the modeling, hypotheses generation and experimental design stages of the experimental science loop.
We argue that GFlowNets can become a valuable tool for AI-driven scientific discovery.
arXiv Detail & Related papers (2023-02-01T17:29:43Z) - Generative structured normalizing flow Gaussian processes applied to
spectroscopic data [4.0773490083614075]
In the physical sciences, limited training data may not adequately characterize future observed data.
It is critical that models adequately indicate uncertainty, particularly when they may be asked to extrapolate.
We demonstrate the methodology on laser-induced breakdown spectroscopy data from the ChemCam instrument onboard the Mars rover Curiosity.
arXiv Detail & Related papers (2022-12-14T23:57:46Z) - Machine learning-accelerated chemistry modeling of protoplanetary disks [0.0]
We used a thermo-chemical modeling code to generate a diverse population of protoplanetary disk models.
We trained a K-nearest neighbors regressor to instantly predict the chemistry of other disk models.
arXiv Detail & Related papers (2022-09-27T12:42:13Z) - Learning inducing points and uncertainty on molecular data by scalable
variational Gaussian processes [0.0]
We show that variational learning of the inducing points in a molecular descriptor space improves the prediction of energies and atomic forces on two molecular dynamics datasets.
We extend our study to a large molecular crystal system, showing that variational GP models perform well for predicting atomic forces by efficiently learning a sparse representation of the dataset.
arXiv Detail & Related papers (2022-07-16T10:41:41Z) - Deep Learning Models of the Discrete Component of the Galactic
Interstellar Gamma-Ray Emission [61.26321023273399]
A significant point-like component from the small scale (or discrete) structure in the H2 interstellar gas might be present in the Fermi-LAT data.
We show that deep learning may be effectively employed to model the gamma-ray emission traced by these rare H2 proxies within statistical significance in data-rich regions.
arXiv Detail & Related papers (2022-06-06T18:00:07Z) - Prediction of liquid fuel properties using machine learning models with
Gaussian processes and probabilistic conditional generative learning [56.67751936864119]
The present work aims to construct cheap-to-compute machine learning (ML) models to act as closure equations for predicting the physical properties of alternative fuels.
Those models can be trained using the database from MD simulations and/or experimental measurements in a data-fusion-fidelity approach.
The results show that ML models can predict accurately the fuel properties of a wide range of pressure and temperature conditions.
arXiv Detail & Related papers (2021-10-18T14:43:50Z) - Visualizing spinon Fermi surfaces with time-dependent spectroscopy [62.997667081978825]
We propose applying time-dependent photo-emission spectroscopy, an established tool in solid state systems, in cold atom quantum simulators.
We show in exact diagonalization simulations of the one-dimensional $t-J$ model that the spinons start to populate previously unoccupied states in an effective band structure.
The dependence of the spectral function on the time after the pump pulse reveals collective interactions among spinons.
arXiv Detail & Related papers (2021-05-27T18:00:02Z) - Machine Learning in Nano-Scale Biomedical Engineering [77.75587007080894]
We review the existing research regarding the use of machine learning in nano-scale biomedical engineering.
The main challenges that can be formulated as ML problems are classified into the three main categories.
For each of the presented methodologies, special emphasis is given to its principles, applications, and limitations.
arXiv Detail & Related papers (2020-08-05T15:45:54Z) - Embedded-physics machine learning for coarse-graining and collective
variable discovery without data [3.222802562733787]
We present a novel learning framework that consistently embeds underlying physics.
We propose a novel objective based on reverse Kullback-Leibler divergence that fully incorporates the available physics in the form of the atomistic force field.
We demonstrate the algorithmic advances in terms of predictive ability and the physical meaning of the revealed CVs for a bimodal potential energy function and the alanine dipeptide.
arXiv Detail & Related papers (2020-02-24T10:28:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.