Graph Neural Networks for Surfactant Multi-Property Prediction
- URL: http://arxiv.org/abs/2401.01874v1
- Date: Wed, 3 Jan 2024 18:32:25 GMT
- Title: Graph Neural Networks for Surfactant Multi-Property Prediction
- Authors: Christoforos Brozos, Jan G. Rittig, Sandip Bhattacharya, Elie Akanny,
Christina Kohlmann, Alexander Mitsos
- Abstract summary: Graph Neural Networks (GNNs) have exhibited a great predictive performance for property prediction of ionic liquids, polymers and drugs in general.
We create the largest available CMC database with 429 molecules and the first large data collection for surface excess concentration.
GNN yields highly accurate predictions for CMC, showing great potential for future industrial applications.
- Score: 38.39977540117143
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Surfactants are of high importance in different industrial sectors such as
cosmetics, detergents, oil recovery and drug delivery systems. Therefore, many
quantitative structure-property relationship (QSPR) models have been developed
for surfactants. Each predictive model typically focuses on one surfactant
class, mostly nonionics. Graph Neural Networks (GNNs) have exhibited a great
predictive performance for property prediction of ionic liquids, polymers and
drugs in general. Specifically for surfactants, GNNs can successfully predict
critical micelle concentration (CMC), a key surfactant property associated with
micellization. A key factor in the predictive ability of QSPR and GNN models is
the data available for training. Based on extensive literature search, we
create the largest available CMC database with 429 molecules and the first
large data collection for surface excess concentration ($\Gamma$$_{m}$),
another surfactant property associated with foaming, with 164 molecules. Then,
we develop GNN models to predict the CMC and $\Gamma$$_{m}$ and we explore
different learning approaches, i.e., single- and multi-task learning, as well
as different training strategies, namely ensemble and transfer learning. We
find that a multi-task GNN with ensemble learning trained on all $\Gamma$$_{m}$
and CMC data performs best. Finally, we test the ability of our CMC model to
generalize on industrial grade pure component surfactants. The GNN yields
highly accurate predictions for CMC, showing great potential for future
industrial applications.
Related papers
- Predicting the Temperature-Dependent CMC of Surfactant Mixtures with Graph Neural Networks [36.814181034608666]
Surfactants are key ingredients in foaming and cleansing products across various industries.
In practice, surfactant mixtures are typically used due to to performance, environmental, and cost reasons.
We develop a graph neural network framework for surfactant mixtures to predict the temperature-dependent CMC.
arXiv Detail & Related papers (2024-11-04T16:17:57Z) - Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure [53.76752789814785]
DumplingGNN is a hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure.
We evaluate it on a comprehensive ADC payload dataset focusing on DNA Topoisomerase I inhibitors.
It demonstrates exceptional accuracy (91.48%), sensitivity (95.08%), and specificity (97.54%) on our specialized ADC payload dataset.
arXiv Detail & Related papers (2024-09-23T17:11:04Z) - Predicting the Temperature Dependence of Surfactant CMCs Using Graph
Neural Networks [38.39977540117143]
classical QSPR and Graph Neural Networks (GNNs) have been successfully applied to predict the CMC of surfactants at room temperature.
We herein develop a GNN model for temperature-dependent CMC prediction of surfactants.
arXiv Detail & Related papers (2024-03-06T15:03:04Z) - Activity Cliff Prediction: Dataset and Benchmark [20.41770222873952]
We first introduce ACNet, a large-scale dataset for AC prediction.
ACNet curates over 400K Matched Molecular Pairs (MMPs) against 190 targets.
We propose a baseline framework to benchmark the predictive performance of molecular representations encoded by deep neural networks for AC prediction.
arXiv Detail & Related papers (2023-02-15T09:19:07Z) - Graph neural networks for the prediction of molecular structure-property
relationships [59.11160990637615]
Graph neural networks (GNNs) are a novel machine learning method that directly work on the molecular graph.
GNNs allow to learn properties in an end-to-end fashion, thereby avoiding the need for informative descriptors.
We describe the fundamentals of GNNs and demonstrate the application of GNNs via two examples for molecular property prediction.
arXiv Detail & Related papers (2022-07-25T11:30:44Z) - Graph Neural Networks for Temperature-Dependent Activity Coefficient
Prediction of Solutes in Ionic Liquids [58.720142291102135]
We present a GNN to predict temperature-dependent infinite dilution ACs of solutes in ILs.
We train the GNN on a database including more than 40,000 AC values and compare it to a state-of-the-art MCM.
The GNN and MCM achieve similar high prediction performance, with the GNN additionally enabling high-quality predictions for ACs of solutions that contain ILs and solutes not considered during training.
arXiv Detail & Related papers (2022-06-23T15:27:29Z) - Orbital Graph Convolutional Neural Network for Material Property
Prediction [0.0]
We propose the Orbital Graph Convolutional Neural Network (OGCNN), a crystal graph convolutional neural network framework.
OGCNN includes atomic orbital interaction features that learn material properties in a robust way.
We examined the performance of this model on a broad range of crystalline material data to predict different properties.
arXiv Detail & Related papers (2020-08-14T15:22:22Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z) - Assessing Graph-based Deep Learning Models for Predicting Flash Point [52.931492216239995]
Graph-based deep learning (GBDL) models were implemented in predicting flash point for the first time.
Average R2 and Mean Absolute Error (MAE) scores of MPNN are, respectively, 2.3% lower and 2.0 K higher than previous comparable studies.
arXiv Detail & Related papers (2020-02-26T06:10:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.