Graph Polish: A Novel Graph Generation Paradigm for Molecular
Optimization
- URL: http://arxiv.org/abs/2008.06246v1
- Date: Fri, 14 Aug 2020 08:36:13 GMT
- Title: Graph Polish: A Novel Graph Generation Paradigm for Molecular
Optimization
- Authors: Chaojie Ji, Yijia Zheng, Ruxin Wang, Yunpeng Cai and Hongyan Wu
- Abstract summary: We present a novel molecular optimization paradigm, Graph Polish, which changes molecular optimization from the traditional "two-language translating" task into a "single-language" task.
We propose an effective and efficient learning framework T&S polish to capture the long-term dependencies in the optimization steps.
- Score: 7.1696593196695035
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Molecular optimization, which transforms a given input molecule X into
another Y with desirable properties, is essential in molecular drug discovery.
The traditional translating approaches, generating the molecular graphs from
scratch by adding some substructures piece by piece, prone to error because of
the large set of candidate substructures in a large number of steps to the
final target. In this study, we present a novel molecular optimization
paradigm, Graph Polish, which changes molecular optimization from the
traditional "two-language translating" task into a "single-language polishing"
task. The key to this optimization paradigm is to find an optimization center
subject to the conditions that the preserved areas around it ought to be
maximized and thereafter the removed and added regions should be minimized. We
then propose an effective and efficient learning framework T&S polish to
capture the long-term dependencies in the optimization steps. The T component
automatically identifies and annotates the optimization centers and the
preservation, removal and addition of some parts of the molecule, and the S
component learns these behaviors and applies these actions to a new molecule.
Furthermore, the proposed paradigm can offer an intuitive interpretation for
each molecular optimization result. Experiments with multiple optimization
tasks are conducted on four benchmark datasets. The proposed T&S polish
approach achieves significant advantage over the five state-of-the-art baseline
methods on all the tasks. In addition, extensive studies are conducted to
validate the effectiveness, explainability and time saving of the novel
optimization paradigm.
Related papers
- Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model [77.50732023411811]
We propose a text-guided multi-property molecular optimization method utilizing transformer-based diffusion language model (TransDLM)
TransDLM leverages standardized chemical nomenclature as semantic representations of molecules and implicitly embeds property requirements into textual descriptions.
Our approach surpasses state-of-the-art methods in optimizing molecular structural similarity and enhancing chemical properties on the benchmark dataset.
arXiv Detail & Related papers (2024-10-17T14:30:27Z) - XMOL: Explainable Multi-property Optimization of Molecules [2.320539066224081]
We propose Explainable Multi-property Optimization of Molecules (XMOL) to optimize multiple molecular properties simultaneously.
Our approach builds on state-of-the-art geometric diffusion models, extending them to multi-property optimization.
We integrate interpretive and explainable techniques throughout the optimization process.
arXiv Detail & Related papers (2024-09-12T06:35:04Z) - Small Molecule Optimization with Large Language Models [17.874902319523663]
We present two language models fine-tuned on a novel corpus of 110M molecules with computed properties, totaling 40B tokens.
We introduce a novel optimization algorithm that leverages our language models to optimize molecules for arbitrary properties given limited access to a black box oracle.
arXiv Detail & Related papers (2024-07-26T17:51:33Z) - Gradual Optimization Learning for Conformational Energy Minimization [69.36925478047682]
Gradual Optimization Learning Framework (GOLF) for energy minimization with neural networks significantly reduces the required additional data.
Our results demonstrate that the neural network trained with GOLF performs on par with the oracle on a benchmark of diverse drug-like molecules.
arXiv Detail & Related papers (2023-11-05T11:48:08Z) - An Empirical Evaluation of Zeroth-Order Optimization Methods on
AI-driven Molecule Optimization [78.36413169647408]
We study the effectiveness of various ZO optimization methods for optimizing molecular objectives.
We show the advantages of ZO sign-based gradient descent (ZO-signGD)
We demonstrate the potential effectiveness of ZO optimization methods on widely used benchmark tasks from the Guacamol suite.
arXiv Detail & Related papers (2022-10-27T01:58:10Z) - Molecular Attributes Transfer from Non-Parallel Data [57.010952598634944]
We formulate molecular optimization as a style transfer problem and present a novel generative model that could automatically learn internal differences between two groups of non-parallel data.
Experiments on two molecular optimization tasks, toxicity modification and synthesizability improvement, demonstrate that our model significantly outperforms several state-of-the-art methods.
arXiv Detail & Related papers (2021-11-30T06:10:22Z) - Reinforced Molecular Optimization with Neighborhood-Controlled Grammars [63.84003497770347]
We propose MNCE-RL, a graph convolutional policy network for molecular optimization.
We extend the original neighborhood-controlled embedding grammars to make them applicable to molecular graph generation.
We show that our approach achieves state-of-the-art performance in a diverse range of molecular optimization tasks.
arXiv Detail & Related papers (2020-11-14T05:42:15Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.