Graphical Gaussian Process Regression Model for Aqueous Solvation Free
Energy Prediction of Organic Molecules in Redox Flow Battery
- URL: http://arxiv.org/abs/2106.08146v1
- Date: Tue, 15 Jun 2021 13:48:26 GMT
- Title: Graphical Gaussian Process Regression Model for Aqueous Solvation Free
Energy Prediction of Organic Molecules in Redox Flow Battery
- Authors: Peiyuan Gao, Xiu Yang, Yu-Hang Tang, Muqing Zheng, Amity Anderson,
Vijayakumar Murugesan, Aaron Hollas, Wei Wang
- Abstract summary: We present a machine learning (ML) model that can learn and predict the aqueous solvation free energy of an organic molecule.
We demonstrate that our ML model can predict the solvation free energy of molecules at chemical accuracy with a mean absolute error of less than 1 kcal/mol.
- Score: 2.7919873713279033
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The solvation free energy of organic molecules is a critical parameter in
determining emergent properties such as solubility, liquid-phase equilibrium
constants, and pKa and redox potentials in an organic redox flow battery. In
this work, we present a machine learning (ML) model that can learn and predict
the aqueous solvation free energy of an organic molecule using Gaussian process
regression method based on a new molecular graph kernel. To investigate the
performance of the ML model on electrostatic interaction, the nonpolar
interaction contribution of solvent and the conformational entropy of solute in
solvation free energy, three data sets with implicit or explicit water solvent
models, and contribution of conformational entropy of solute are tested. We
demonstrate that our ML model can predict the solvation free energy of
molecules at chemical accuracy with a mean absolute error of less than 1
kcal/mol for subsets of the QM9 dataset and the Freesolv database. To solve the
general data scarcity problem for a graph-based ML model, we propose a
dimension reduction algorithm based on the distance between molecular graphs,
which can be used to examine the diversity of the molecular data set. It
provides a promising way to build a minimum training set to improve prediction
for certain test sets where the space of molecular structures is predetermined.
Related papers
- Conditional Synthesis of 3D Molecules with Time Correction Sampler [58.0834973489875]
Time-Aware Conditional Synthesis (TACS) is a novel approach to conditional generation on diffusion models.
It integrates adaptively controlled plug-and-play "online" guidance into a diffusion model, driving samples toward the desired properties.
arXiv Detail & Related papers (2024-11-01T12:59:25Z) - Hessian QM9: A quantum chemistry database of molecular Hessians in implicit solvents [0.31457219084519]
Hessian QM9 is the first database of equilibrium configurations and numerical Hessian matrices.
We show that incorporating second derivatives of the potential energy surface into the loss function of a MLIP significantly improves the prediction of vibrational frequencies in all solvent environments.
arXiv Detail & Related papers (2024-08-15T08:16:11Z) - Predicting solvation free energies with an implicit solvent machine learning potential [0.0]
We introduce a Solvation Free Energy Path Reweighting (ReSolv) framework to parametrize an implicit solvent ML potential for small organic molecules.
With a combination of top-down (experimental hydration free energy data) and bottom-up (ab initio data of molecules in a vacuum) learning, ReSolv bypasses the need for intractable ab initio data of molecules in explicit bulk solvent.
Compared to the explicit solvent ML potential, ReSolv offers a computational speedup of four orders of magnitude and attains closer agreement with experiments.
arXiv Detail & Related papers (2024-05-31T20:28:08Z) - Molecule Design by Latent Prompt Transformer [76.2112075557233]
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
arXiv Detail & Related papers (2024-02-27T03:33:23Z) - Predicting Drug Solubility Using Different Machine Learning Methods --
Linear Regression Model with Extracted Chemical Features vs Graph
Convolutional Neural Network [1.8936798735951967]
We employ two machine learning models: a linear regression model and a graph convolutional neural network (GCNN) model, using various experimental datasets.
The present GCNN model has limited interpretability while the linear regression model allows scientists for a greater in-depth analysis of the underlying factors.
From the perspective of chemistry, using the linear regression model, we elucidated the impact of individual atom species and functional groups on overall solubility.
arXiv Detail & Related papers (2023-08-23T15:35:20Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - Machine Learning Implicit Solvation for Molecular Dynamics [0.0]
We introduce Bornet, a graph neural network, to model the implicit solvent potential of mean force.
The success of this novel method demonstrates the potential benefit of applying machine learning methods in accurate modeling of solvent effects.
arXiv Detail & Related papers (2021-06-14T15:21:45Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - Accurate Prediction of Free Solvation Energy of Organic Molecules via
Graph Attention Network and Message Passing Neural Network from Pairwise
Atomistic Interactions [14.87390785780636]
We propose two novel models for the problem of free solvation energy predictions, based on the Graph Neural Network (GNN) architectures.
GNNs are capable of summarizing the predictive information of a molecule as low-dimensional features directly from its graph structure.
We show that our proposed models outperform all quantum mechanical and molecular dynamics methods in addition to existing alternative machine learning based approaches in the task of solvation free energy prediction.
arXiv Detail & Related papers (2021-04-15T22:15:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.