Guiding Deep Molecular Optimization with Genetic Exploration
- URL: http://arxiv.org/abs/2007.04897v3
- Date: Tue, 27 Oct 2020 10:49:47 GMT
- Title: Guiding Deep Molecular Optimization with Genetic Exploration
- Authors: Sungsoo Ahn, Junsu Kim, Hankook Lee, Jinwoo Shin
- Abstract summary: We propose genetic expert-guided learning (GEGL), a framework for training a deep neural network (DNN) to generate highly-rewarding molecules.
Extensive experiments show that GEGL significantly improves over state-of-the-art methods.
- Score: 79.50698140997726
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: De novo molecular design attempts to search over the chemical space for
molecules with the desired property. Recently, deep learning has gained
considerable attention as a promising approach to solve the problem. In this
paper, we propose genetic expert-guided learning (GEGL), a simple yet novel
framework for training a deep neural network (DNN) to generate highly-rewarding
molecules. Our main idea is to design a "genetic expert improvement" procedure,
which generates high-quality targets for imitation learning of the DNN.
Extensive experiments show that GEGL significantly improves over
state-of-the-art methods. For example, GEGL manages to solve the penalized
octanol-water partition coefficient optimization with a score of 31.40, while
the best-known score in the literature is 27.22. Besides, for the GuacaMol
benchmark with 20 tasks, our method achieves the highest score for 19 tasks, in
comparison with state-of-the-art methods, and newly obtains the perfect score
for three tasks.
Related papers
- TAGMol: Target-Aware Gradient-guided Molecule Generation [19.977071499171903]
3D generative models have shown significant promise in structure-based drug design (SBDD)
We decouple the problem into molecular generation and property prediction.
The latter synergistically guides the diffusion sampling process, facilitating guided diffusion and resulting in the creation of meaningful molecules with the desired properties.
We call this guided molecular generation process as TAGMol.
arXiv Detail & Related papers (2024-06-03T14:43:54Z) - Data-Efficient Molecular Generation with Hierarchical Textual Inversion [48.816943690420224]
We introduce Hierarchical textual Inversion for Molecular generation (HI-Mol), a novel data-efficient molecular generation method.
HI-Mol is inspired by the importance of hierarchical information, e.g., both coarse- and fine-grained features, in understanding the molecule distribution.
Compared to the conventional textual inversion method in the image domain using a single-level token embedding, our multi-level token embeddings allow the model to effectively learn the underlying low-shot molecule distribution.
arXiv Detail & Related papers (2024-05-05T08:35:23Z) - Genetic-guided GFlowNets for Sample Efficient Molecular Optimization [33.270494123656746]
Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency.
This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy.
arXiv Detail & Related papers (2024-02-05T04:12:40Z) - Efficient Heterogeneous Graph Learning via Random Projection [58.4138636866903]
Heterogeneous Graph Neural Networks (HGNNs) are powerful tools for deep learning on heterogeneous graphs.
Recent pre-computation-based HGNNs use one-time message passing to transform a heterogeneous graph into regular-shaped tensors.
We propose a hybrid pre-computation-based HGNN, named Random Projection Heterogeneous Graph Neural Network (RpHGNN)
arXiv Detail & Related papers (2023-10-23T01:25:44Z) - An Empirical Evaluation of Zeroth-Order Optimization Methods on
AI-driven Molecule Optimization [78.36413169647408]
We study the effectiveness of various ZO optimization methods for optimizing molecular objectives.
We show the advantages of ZO sign-based gradient descent (ZO-signGD)
We demonstrate the potential effectiveness of ZO optimization methods on widely used benchmark tasks from the Guacamol suite.
arXiv Detail & Related papers (2022-10-27T01:58:10Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - De novo design of protein target specific scaffold-based Inhibitors via
Reinforcement Learning [8.210294479991118]
Current approaches to develop molecules for a target protein are intuition-driven, hampered by slow iterative design-test cycles.
We propose a novel framework, called 3D-MolGNN$_RL$, coupling reinforcement learning to a deep generative model based on 3D-Scaffold.
Our approach can serve as an interpretable artificial intelligence (AI) tool for lead optimization with optimized activity, potency, and biophysical properties.
arXiv Detail & Related papers (2022-05-21T00:47:35Z) - CELLS: Cost-Effective Evolution in Latent Space for Goal-Directed
Molecular Generation [23.618366377098614]
We propose a cost-effective evolution strategy in latent space, which optimize the molecular latent representation vectors.
We adopt a pre-trained molecular generative model to map the latent and observation spaces.
We conduct extensive experiments on multiple optimization tasks comparing the proposed framework to several advanced techniques.
arXiv Detail & Related papers (2021-11-30T11:02:18Z) - Improving RNA Secondary Structure Design using Deep Reinforcement
Learning [69.63971634605797]
We propose a new benchmark of applying reinforcement learning to RNA sequence design, in which the objective function is defined to be the free energy in the sequence's secondary structure.
We show results of the ablation analysis that we do for these algorithms, as well as graphs indicating the algorithm's performance across batches.
arXiv Detail & Related papers (2021-11-05T02:54:06Z) - JANUS: Parallel Tempered Genetic Algorithm Guided by Deep Neural
Networks for Inverse Molecular Design [1.6114012813668934]
Inverse molecular design, i.e., designing molecules with specific target properties, can be posed as an optimization problem.
Janus is a genetic algorithm inspired by parallel tempering that propagates two populations, one for exploration and another for exploitation.
Janus is augmented by a deep neural network that approximates molecular properties via active learning for enhanced sampling of the chemical space.
arXiv Detail & Related papers (2021-06-07T23:41:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.