Scaffold-constrained molecular generation
- URL: http://arxiv.org/abs/2009.07778v3
- Date: Mon, 5 Oct 2020 10:50:26 GMT
- Title: Scaffold-constrained molecular generation
- Authors: Maxime Langevin, Herve Minoux, Maximilien Levesque, Marc Bianciotto
- Abstract summary: We build on the well-known SMILES-based Recurrent Neural Network (RNN) generative model, with a modified sampling procedure to achieve scaffold-constrained generation.
We showcase the method's ability to perform scaffold-constrained generation on various tasks.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the major applications of generative models for drug Discovery targets
the lead-optimization phase. During the optimization of a lead series, it is
common to have scaffold constraints imposed on the structure of the molecules
designed. Without enforcing such constraints, the probability of generating
molecules with the required scaffold is extremely low and hinders the
practicality of generative models for de-novo drug design. To tackle this
issue, we introduce a new algorithm to perform scaffold-constrained in-silico
molecular design. We build on the well-known SMILES-based Recurrent Neural
Network (RNN) generative model, with a modified sampling procedure to achieve
scaffold-constrained generation. We directly benefit from the associated
reinforcement Learning methods, allowing to design molecules optimized for
different properties while exploring only the relevant chemical space. We
showcase the method's ability to perform scaffold-constrained generation on
various tasks: designing novel molecules around scaffolds extracted from
SureChEMBL chemical series, generating novel active molecules on the Dopamine
Receptor D2 (DRD2) target, and, finally, designing predicted actives on the
MMP-12 series, an industrial lead-optimization project.
Related papers
- Exploring Discrete Flow Matching for 3D De Novo Molecule Generation [0.0]
Flow matching is a recently proposed generative modeling framework that has achieved impressive performance on a variety of tasks.
We present FlowMol-CTMC, an open-source model that achieves state of the art performance for 3D de novo design with fewer learnable parameters than existing methods.
arXiv Detail & Related papers (2024-11-25T18:27:39Z) - DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization [49.85944390503957]
DecompOpt is a structure-based molecular optimization method based on a controllable and diffusion model.
We show that DecompOpt can efficiently generate molecules with improved properties than strong de novo baselines.
arXiv Detail & Related papers (2024-03-07T02:53:40Z) - Molecule Design by Latent Prompt Transformer [76.2112075557233]
This work explores the challenging problem of molecule design by framing it as a conditional generative modeling task.
We propose a novel generative model comprising three components: (1) a latent vector with a learnable prior distribution; (2) a molecule generation model based on a causal Transformer, which uses the latent vector as a prompt; and (3) a property prediction model that predicts a molecule's target properties and/or constraint values using the latent prompt.
arXiv Detail & Related papers (2024-02-27T03:33:23Z) - DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design [62.68420322996345]
Existing structured-based drug design methods treat all ligand atoms equally.
We propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold.
Our approach achieves state-of-the-art performance in generating high-affinity molecules.
arXiv Detail & Related papers (2024-02-26T05:21:21Z) - Optimizing Drug Design by Merging Generative AI With Active Learning
Frameworks [2.6062146828550903]
We have developed a Generative AI (GM) workflow based on a variational autoencoder and active learning steps.
The designed GM workflow iteratively learns from molecular metrics, including drug likeliness, synthesizability, similarity, and docking scores.
The proportion of high-affinity molecules inferred by our GM workflow was significantly greater than that in the training data.
arXiv Detail & Related papers (2023-05-04T13:25:14Z) - Retrieval-based Controllable Molecule Generation [63.44583084888342]
We propose a new retrieval-based framework for controllable molecule generation.
We use a small set of molecules to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria.
Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning.
arXiv Detail & Related papers (2022-08-23T17:01:16Z) - Exploring Chemical Space with Score-based Out-of-distribution Generation [57.15855198512551]
We propose a score-based diffusion scheme that incorporates out-of-distribution control in the generative differential equation (SDE)
Since some novel molecules may not meet the basic requirements of real-world drugs, MOOD performs conditional generation by utilizing the gradients from a property predictor.
We experimentally validate that MOOD is able to explore the chemical space beyond the training distribution, generating molecules that outscore ones found with existing methods, and even the top 0.01% of the original training pool.
arXiv Detail & Related papers (2022-06-06T06:17:11Z) - Hit and Lead Discovery with Explorative RL and Fragment-based Molecule
Generation [34.26748101294543]
We propose a novel framework that generates pharmacochemically acceptable molecules with large docking scores.
Our method constrains the generated molecules to a realistic and qualified chemical space and effectively explores the space to find drugs.
Our model produces molecules of higher quality compared to existing methods while achieving state-of-the-art performance on two of three targets.
arXiv Detail & Related papers (2021-10-04T07:21:00Z) - Learning to Extend Molecular Scaffolds with Structural Motifs [15.78749196233448]
MoLeR is a graph-based model that supports scaffolds as initial seed of the generative procedure.
We show that MoLeR performs comparably to state-of-the-art methods on unconstrained molecular optimization tasks.
We also show the influence of a number of seemingly minor design choices on the overall performance.
arXiv Detail & Related papers (2021-03-05T18:28:49Z) - MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization [51.00815310242277]
generative models and reinforcement learning approaches made initial success, but still face difficulties in simultaneously optimizing multiple drug properties.
We propose the MultI-constraint MOlecule SAmpling (MIMOSA) approach, a sampling framework to use input molecule as an initial guess and sample molecules from the target distribution.
arXiv Detail & Related papers (2020-10-05T20:18:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.