Molecular-orbital-based Machine Learning for Open-shell and
Multi-reference Systems with Kernel Addition Gaussian Process Regression
- URL: http://arxiv.org/abs/2207.08317v1
- Date: Sun, 17 Jul 2022 23:20:19 GMT
- Title: Molecular-orbital-based Machine Learning for Open-shell and
Multi-reference Systems with Kernel Addition Gaussian Process Regression
- Authors: Lixue Cheng, Jiace Sun, J. Emiliano Deustua, Vignesh C. Bhethanabotla,
Thomas F. Miller III
- Abstract summary: We introduce a novel machine learning strategy, kernel addition Gaussian process regression (KA-GPR), in molecular-orbital-based machine learning (MOB-ML)
We learn the total correlation energies of general electronic structure theories for closed- and open-shell systems by introducing a machine learning strategy.
The learning efficiency of MOB-ML (KA-GPR) is the same as the original MOB-ML method for the smallest criegee molecule, which is a closed-shell molecule with multi-reference characters.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce a novel machine learning strategy, kernel addition Gaussian
process regression (KA-GPR), in molecular-orbital-based machine learning
(MOB-ML) to learn the total correlation energies of general electronic
structure theories for closed- and open-shell systems by introducing a machine
learning strategy. The learning efficiency of MOB-ML (KA-GPR) is the same as
the original MOB-ML method for the smallest criegee molecule, which is a
closed-shell molecule with multi-reference characters. In addition, the
prediction accuracies of different small free radicals could reach the chemical
accuracy of 1 kcal/mol by training on one example structure. Accurate potential
energy surfaces for the H10 chain (closed-shell) and water OH bond dissociation
(open-shell) could also be generated by MOB-ML (KA-GPR). To explore the breadth
of chemical systems that KA-GPR can describe, we further apply MOB-ML to
accurately predict the large benchmark datasets for closed- (QM9, QM7b-T,
GDB-13-T) and open-shell (QMSpin) molecules.
Related papers
- Pre-trained Molecular Language Models with Random Functional Group Masking [54.900360309677794]
We propose a SMILES-based underlineem Molecular underlineem Language underlineem Model, which randomly masking SMILES subsequences corresponding to specific molecular atoms.
This technique aims to compel the model to better infer molecular structures and properties, thus enhancing its predictive capabilities.
arXiv Detail & Related papers (2024-11-03T01:56:15Z) - Active learning of Boltzmann samplers and potential energies with quantum mechanical accuracy [1.7633275579210346]
We develop an approach combining enhanced sampling with deep generative models and active learning of a machine learning potential.
We apply this method to study the isomerization of an ultrasmall silver nanocluster, belonging to a set of systems with diverse applications in the fields of medicine and biology.
arXiv Detail & Related papers (2024-01-29T19:01:31Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Molecular Dipole Moment Learning via Rotationally Equivariant Gaussian
Process Regression with Derivatives in Molecular-orbital-based Machine
Learning [0.0]
This study extends the accurate and transferable molecular-orbital-based machine learning (MOB-ML) approach.
A molecular-orbital-based (MOB) pairwise decomposition of the correlation part of the dipole moment is applied.
The proposed problem setup, feature design, and ML algorithm are shown to provide highly-accurate models.
arXiv Detail & Related papers (2022-05-31T02:42:50Z) - Accurate Machine Learned Quantum-Mechanical Force Fields for
Biomolecular Simulations [51.68332623405432]
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes.
Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations.
This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations.
arXiv Detail & Related papers (2022-05-17T13:08:28Z) - Accurate Molecular-Orbital-Based Machine Learning Energies via
Unsupervised Clustering of Chemical Space [0.0]
We introduce an unsupervised clustering algorithm to improve training efficiency and accuracy in predicting energies using molecular-orbital-based machine learning (MOB-ML)
This work determines clusters via the Gaussian mixture model (GMM) in an entirely automatic manner.
arXiv Detail & Related papers (2022-04-21T00:56:16Z) - Chemical-Reaction-Aware Molecule Representation Learning [88.79052749877334]
We propose using chemical reactions to assist learning molecule representation.
Our approach is proven effective to 1) keep the embedding space well-organized and 2) improve the generalization ability of molecule embeddings.
Experimental results demonstrate that our method achieves state-of-the-art performance in a variety of downstream tasks.
arXiv Detail & Related papers (2021-09-21T00:08:43Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - A Universal Framework for Featurization of Atomistic Systems [0.0]
Reactive force fields based on physics or machine learning can be used to bridge the gap in time and length scales.
We introduce the Gaussian multi-pole (GMP) featurization scheme that utilizes physically-relevant multi-pole expansions of the electron density around atoms.
We demonstrate that GMP-based models can achieve chemical accuracy for the QM9 dataset, and their accuracy remains reasonable even when extrapolating to new elements.
arXiv Detail & Related papers (2021-02-04T03:11:00Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.