Differentiable Scaffolding Tree for Molecular Optimization
- URL: http://arxiv.org/abs/2109.10469v1
- Date: Wed, 22 Sep 2021 01:16:22 GMT
- Title: Differentiable Scaffolding Tree for Molecular Optimization
- Authors: Tianfan Fu, Wenhao Gao, Cao Xiao, Jacob Yasonik, Connor W. Coley,
Jimeng Sun
- Abstract summary: We propose differentiable scaffolding tree (DST) that utilizes a learned knowledge network to convert discrete chemical structures to locally differentiable ones.
Our empirical studies show the gradient-based molecular optimizations are both effective and sample efficient.
- Score: 47.447362691543304
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The structural design of functional molecules, also called molecular
optimization, is an essential chemical science and engineering task with
important applications, such as drug discovery. Deep generative models and
combinatorial optimization methods achieve initial success but still struggle
with directly modeling discrete chemical structures and often heavily rely on
brute-force enumeration. The challenge comes from the discrete and
non-differentiable nature of molecule structures. To address this, we propose
differentiable scaffolding tree (DST) that utilizes a learned knowledge network
to convert discrete chemical structures to locally differentiable ones. DST
enables a gradient-based optimization on a chemical graph structure by
back-propagating the derivatives from the target properties through a graph
neural network (GNN). Our empirical studies show the gradient-based molecular
optimizations are both effective and sample efficient. Furthermore, the learned
graph parameters can also provide an explanation that helps domain experts
understand the model output.
Related papers
- Investigating Graph Neural Networks and Classical Feature-Extraction Techniques in Activity-Cliff and Molecular Property Prediction [0.6906005491572401]
Molecular featurisation refers to the transformation of molecular data into numerical feature vectors.
Message-passing graph neural networks (GNNs) have emerged as a novel method to learn differentiable features directly from molecular graphs.
arXiv Detail & Related papers (2024-11-20T20:07:48Z) - GraphXForm: Graph transformer for computer-aided molecular design with application to extraction [73.1842164721868]
We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned.
We evaluate it on two solvent design tasks for liquid-liquid extraction, showing that it outperforms four state-of-the-art molecular design techniques.
arXiv Detail & Related papers (2024-11-03T19:45:15Z) - Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [77.50732023411811]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM)
TransDLM leverages standardized chemical nomenclature as semantic representations of molecules and implicitly embeds property requirements into textual descriptions.
Our approach surpasses state-of-the-art methods in optimizing molecular structural similarity and enhancing chemical properties on the benchmark dataset.
arXiv Detail & Related papers (2024-10-17T14:30:27Z) - Molecule Design by Latent Prompt Transformer [76.2112075557233]
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
arXiv Detail & Related papers (2024-02-27T03:33:23Z) - MolCPT: Molecule Continuous Prompt Tuning to Generalize Molecular
Representation Learning [77.31492888819935]
We propose a novel paradigm of "pre-train, prompt, fine-tune" for molecular representation learning, named molecule continuous prompt tuning (MolCPT)
MolCPT defines a motif prompting function that uses the pre-trained model to project the standalone input into an expressive prompt.
Experiments on several benchmark datasets show that MolCPT efficiently generalizes pre-trained GNNs for molecular property prediction.
arXiv Detail & Related papers (2022-12-20T19:32:30Z) - Semi-Supervised GCN for learning Molecular Structure-Activity
Relationships [4.468952886990851]
We propose to train graph-to-graph neural network using semi-supervised learning for attributing structure-property relationships.
As final goal, our approach could represent a valuable tool to deal with problems such as activity cliffs, lead optimization and de-novo drug design.
arXiv Detail & Related papers (2022-01-25T09:09:43Z) - Molecular Graph Generation via Geometric Scattering [7.796917261490019]
Graph neural networks (GNNs) have been used extensively for addressing problems in drug design and discovery.
We propose a representation-first approach to molecular graph generation.
We show that our architecture learns meaningful representations of drug datasets and provides a platform for goal-directed drug synthesis.
arXiv Detail & Related papers (2021-10-12T18:00:23Z) - Realistic molecule optimization on a learned graph manifold [4.640835690336652]
We show that learned realism sampling produces empirically more realistic molecules and outperforms all recent baselines in the task of molecule optimization with similarity constraints.
In this work we use a hybrid approach, where the dataset distribution is learned using an autoregressive model while the score optimization is done using the Metropolis algorithm.
arXiv Detail & Related papers (2021-06-03T07:39:35Z) - Deep Molecular Dreaming: Inverse machine learning for de-novo molecular
design and interpretability with surjective representations [1.433758865948252]
We propose PASITHEA, a gradient-based molecule optimization technique from computer vision.
It exploits the use of gradients by directly reversing the learning process of a neural network, which is trained to predict real-valued chemical properties.
Although our results are preliminary, we observe a shift in distribution of a chosen property during inverse-training, a clear indication of PASITHEA's viability.
arXiv Detail & Related papers (2020-12-17T16:34:59Z) - Reinforced Molecular Optimization with Neighborhood-Controlled Grammars [63.84003497770347]
We propose MNCE-RL, a graph convolutional policy network for molecular optimization.
We extend the original neighborhood-controlled embedding grammars to make them applicable to molecular graph generation.
We show that our approach achieves state-of-the-art performance in a diverse range of molecular optimization tasks.
arXiv Detail & Related papers (2020-11-14T05:42:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.