Graphical Gaussian Process Regression Model for Aqueous Solvation Free
Energy Prediction of Organic Molecules in Redox Flow Battery
- URL: http://arxiv.org/abs/2106.08146v1
- Date: Tue, 15 Jun 2021 13:48:26 GMT
- Title: Graphical Gaussian Process Regression Model for Aqueous Solvation Free
Energy Prediction of Organic Molecules in Redox Flow Battery
- Authors: Peiyuan Gao, Xiu Yang, Yu-Hang Tang, Muqing Zheng, Amity Anderson,
Vijayakumar Murugesan, Aaron Hollas, Wei Wang
- Abstract summary: We present a machine learning (ML) model that can learn and predict the aqueous solvation free energy of an organic molecule.
We demonstrate that our ML model can predict the solvation free energy of molecules at chemical accuracy with a mean absolute error of less than 1 kcal/mol.
- Score: 2.7919873713279033
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The solvation free energy of organic molecules is a critical parameter in
determining emergent properties such as solubility, liquid-phase equilibrium
constants, and pKa and redox potentials in an organic redox flow battery. In
this work, we present a machine learning (ML) model that can learn and predict
the aqueous solvation free energy of an organic molecule using Gaussian process
regression method based on a new molecular graph kernel. To investigate the
performance of the ML model on electrostatic interaction, the nonpolar
interaction contribution of solvent and the conformational entropy of solute in
solvation free energy, three data sets with implicit or explicit water solvent
models, and contribution of conformational entropy of solute are tested. We
demonstrate that our ML model can predict the solvation free energy of
molecules at chemical accuracy with a mean absolute error of less than 1
kcal/mol for subsets of the QM9 dataset and the Freesolv database. To solve the
general data scarcity problem for a graph-based ML model, we propose a
dimension reduction algorithm based on the distance between molecular graphs,
which can be used to examine the diversity of the molecular data set. It
provides a promising way to build a minimum training set to improve prediction
for certain test sets where the space of molecular structures is predetermined.
Related papers
- Predicting solvation free energies with an implicit solvent machine learning potential [0.0]
We introduce a Solvation Free Energy Path Reweighting (ReSolv) framework to parametrize an implicit solvent ML potential for small organic molecules.
With a combination of top-down (experimental hydration free energy data) and bottom-up (ab initio data of molecules in a vacuum) learning, ReSolv bypasses the need for intractable ab initio data of molecules in explicit bulk solvent.
On the FreeSolv dataset, ReSolv achieves a mean absolute error close to average experimental uncertainty, significantly outperforming standard explicit solvent force fields.
arXiv Detail & Related papers (2024-05-31T20:28:08Z) - Graph Diffusion Transformer for Multi-Conditional Molecular Generation [16.58392955245203]
We present the Graph Diffusion Transformer (Graph DiT) for multi-conditional molecular generation.
Graph DiT has a condition encoder to learn the representation of numerical and categorical properties.
We extensively validate the Graph DiT for multi-conditional polymer and small molecule generation.
arXiv Detail & Related papers (2024-01-24T23:45:31Z) - Predicting Drug Solubility Using Different Machine Learning Methods --
Linear Regression Model with Extracted Chemical Features vs Graph
Convolutional Neural Network [1.8936798735951967]
We employ two machine learning models: a linear regression model and a graph convolutional neural network (GCNN) model, using various experimental datasets.
The present GCNN model has limited interpretability while the linear regression model allows scientists for a greater in-depth analysis of the underlying factors.
From the perspective of chemistry, using the linear regression model, we elucidated the impact of individual atom species and functional groups on overall solubility.
arXiv Detail & Related papers (2023-08-23T15:35:20Z) - Molecule Design by Latent Space Energy-Based Modeling and Gradual
Distribution Shifting [53.44684898432997]
Generation of molecules with desired chemical and biological properties is critical for drug discovery.
We propose a probabilistic generative model to capture the joint distribution of molecules and their properties.
Our method achieves very strong performances on various molecule design tasks.
arXiv Detail & Related papers (2023-06-09T03:04:21Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Bi-level Contrastive Learning for Knowledge-Enhanced Molecule
Representations [55.42602325017405]
We propose a novel method called GODE, which takes into account the two-level structure of individual molecules.
By pre-training two graph neural networks (GNNs) on different graph structures, combined with contrastive learning, GODE fuses molecular structures with their corresponding knowledge graph substructures.
When fine-tuned across 11 chemical property tasks, our model outperforms existing benchmarks, registering an average ROC-AUC uplift of 13.8% for classification tasks and an average RMSE/MAE enhancement of 35.1% for regression tasks.
arXiv Detail & Related papers (2023-06-02T15:49:45Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - MUDiff: Unified Diffusion for Complete Molecule Generation [104.7021929437504]
We present a new model for generating a comprehensive representation of molecules, including atom features, 2D discrete molecule structures, and 3D continuous molecule coordinates.
We propose a novel graph transformer architecture to denoise the diffusion process.
Our model is a promising approach for designing stable and diverse molecules and can be applied to a wide range of tasks in molecular modeling.
arXiv Detail & Related papers (2023-04-28T04:25:57Z) - Machine Learning Implicit Solvation for Molecular Dynamics [0.0]
We introduce Bornet, a graph neural network, to model the implicit solvent potential of mean force.
The success of this novel method demonstrates the potential benefit of applying machine learning methods in accurate modeling of solvent effects.
arXiv Detail & Related papers (2021-06-14T15:21:45Z) - BIGDML: Towards Exact Machine Learning Force Fields for Materials [55.944221055171276]
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof.
Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 atoms.
arXiv Detail & Related papers (2021-06-08T10:14:57Z) - Accurate Prediction of Free Solvation Energy of Organic Molecules via
Graph Attention Network and Message Passing Neural Network from Pairwise
Atomistic Interactions [14.87390785780636]
We propose two novel models for the problem of free solvation energy predictions, based on the Graph Neural Network (GNN) architectures.
GNNs are capable of summarizing the predictive information of a molecule as low-dimensional features directly from its graph structure.
We show that our proposed models outperform all quantum mechanical and molecular dynamics methods in addition to existing alternative machine learning based approaches in the task of solvation free energy prediction.
arXiv Detail & Related papers (2021-04-15T22:15:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.