A graph representation of molecular ensembles for polymer property
prediction
- URL: http://arxiv.org/abs/2205.08619v1
- Date: Tue, 17 May 2022 20:31:43 GMT
- Title: A graph representation of molecular ensembles for polymer property
prediction
- Authors: Matteo Aldeghi and Connor W. Coley
- Abstract summary: In contrast to organic molecules, polymers are often not well-defined single structures but an ensemble of similar molecules.
We introduce a graph representation of molecular ensembles and an associated graph neural network architecture that is tailored to polymer property prediction.
- Score: 3.032184156362992
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Synthetic polymers are versatile and widely used materials. Similar to small
organic molecules, a large chemical space of such materials is hypothetically
accessible. Computational property prediction and virtual screening can
accelerate polymer design by prioritizing candidates expected to have favorable
properties. However, in contrast to organic molecules, polymers are often not
well-defined single structures but an ensemble of similar molecules, which
poses unique challenges to traditional chemical representations and machine
learning approaches. Here, we introduce a graph representation of molecular
ensembles and an associated graph neural network architecture that is tailored
to polymer property prediction. We demonstrate that this approach captures
critical features of polymeric materials, like chain architecture, monomer
stoichiometry, and degree of polymerization, and achieves superior accuracy to
off-the-shelf cheminformatics methodologies. While doing so, we built a dataset
of simulated electron affinity and ionization potential values for >40k
polymers with varying monomer composition, stoichiometry, and chain
architecture, which may be used in the development of other tailored machine
learning approaches. The dataset and machine learning models presented in this
work pave the path toward new classes of algorithms for polymer informatics
and, more broadly, introduce a framework for the modeling of molecular
ensembles.
Related papers
- GraphXForm: Graph transformer for computer-aided molecular design with application to extraction [73.1842164721868]
We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned.
We evaluate it on two solvent design tasks for liquid-liquid extraction, showing that it outperforms four state-of-the-art molecular design techniques.
arXiv Detail & Related papers (2024-11-03T19:45:15Z) - Molecular topological deep learning for polymer property prediction [18.602659324026934]
We develop molecular topological deep learning (Mol-TDL) for polymer property analysis.
Mol-TDL incorporates both high-order interactions and multiscale properties into topological deep learning architecture.
arXiv Detail & Related papers (2024-10-07T05:44:02Z) - Inverse Design of Copolymers Including Stoichiometry and Chain Architecture [0.0]
Machine learning-guided molecular design is a promising approach to accelerate polymer discovery.
We develop a novel variational autoencoder architecture encoding a graph and decoding a string.
Our model learns a continuous, well organized latent space that enables de-novo generation of copolymer structures.
arXiv Detail & Related papers (2024-09-30T15:37:39Z) - Towards Predicting Equilibrium Distributions for Molecular Systems with
Deep Learning [60.02391969049972]
We introduce a novel deep learning framework, called Distributional Graphormer (DiG), in an attempt to predict the equilibrium distribution of molecular systems.
DiG employs deep neural networks to transform a simple distribution towards the equilibrium distribution, conditioned on a descriptor of a molecular system.
arXiv Detail & Related papers (2023-06-08T17:12:08Z) - Atomic and Subgraph-aware Bilateral Aggregation for Molecular
Representation Learning [57.670845619155195]
We introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA)
ASBA addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information.
Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications.
arXiv Detail & Related papers (2023-05-22T00:56:00Z) - An Equivariant Generative Framework for Molecular Graph-Structure
Co-Design [54.92529253182004]
We present MolCode, a machine learning-based generative framework for underlineMolecular graph-structure underlineCo-design.
In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure.
Our investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design.
arXiv Detail & Related papers (2023-04-12T13:34:22Z) - Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular
Property Prediction [53.06671763877109]
We develop molecular embeddings that encode complex molecular characteristics to improve the performance of few-shot molecular property prediction.
Our approach leverages large amounts of synthetic data, namely the results of molecular docking calculations.
On multiple molecular property prediction benchmarks, training from the embedding space substantially improves Multi-Task, MAML, and Prototypical Network few-shot learning performance.
arXiv Detail & Related papers (2023-02-04T01:32:40Z) - A Molecular Multimodal Foundation Model Associating Molecule Graphs with
Natural Language [63.60376252491507]
We propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data.
We believe that our model would have a broad impact on AI-empowered fields across disciplines such as biology, chemistry, materials, environment, and medicine.
arXiv Detail & Related papers (2022-09-12T00:56:57Z) - Representing Polymers as Periodic Graphs with Learned Descriptors for
Accurate Polymer Property Predictions [16.468017785818198]
We develop a periodic polymer graph representation that consistently outperforms hand-designed representations.
We also demonstrate how combining polymer graph representations with message-passing neural network architectures can automatically extract meaningful polymer features.
arXiv Detail & Related papers (2022-05-27T04:14:12Z) - Copolymer Informatics with Multi-Task Deep Neural Networks [0.0]
We address the property prediction challenge for copolymers, extending the polymer informatics framework beyond homopolymers.
A large data set containing over 18,000 data points of glass transition, melting, and degradation temperature of homopolymers and copolymers of up to two monomers is used.
The developed models are accurate, fast, flexible, and scalable to more copolymer properties when suitable data become available.
arXiv Detail & Related papers (2021-03-25T23:28:20Z) - Representations of molecules and materials for interpolation of
quantum-mechanical simulations via machine learning [0.0]
Computational study of molecules and materials from first principles is a cornerstone of physics, chemistry, and materials science.
In settings involving many simulations, machine learning can reduce these costs, often by orders of magnitude, by interpolating between reference simulations.
We comprehensively review and discuss current representations and relations between them, using a unified mathematical framework.
arXiv Detail & Related papers (2020-03-26T18:01:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.